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METHODS FOR DIAGNOSING, PREVENTING, AND TREATING 
DEVELOPMENTAL DISORDERS DUE TO A COMBINATION OF 
GENETIC AND ENVIRONMENTAL FACTORS 

CROSS-REFERENCE TO RELATED APPLICATIONS 

5 The present application is a non-provisional application claiming the priority of 

copending provisional U.S. Serial No. 60/136,198 filed May 25, 1999, the disclosure 
of which is hereby incorporated by reference in its entirety. Applicants claim the 
benefits of this application under 35 U.S.C. §1 19(e). 



10 The invention relates generally to novel methods of diagnosing, preventing, and 
treating specific diseases which are caused by a combination of genetic and 
environmental factors. One such disease exemplified is schizophrenia. 



The term "schizophrenia" was introduced by Bleuler in the beginning of this century 
15 to encompass a dissociation or disruption of thought processes, along with a 

dichotomy among thought, emotion, and behavior [Bleuler, Translation J. Zinkin, 
New York: International University Press (1950)]. The current definition of 
schizophrenia includes a break with reality that is usually manifested as 
hallucinations, delusions, or disruption in thought processes [Carpenter et al, Medical 
20 Progress, 330:681-690 (1994)]. At present the nationally accepted definition for the 
diagnosis of schizophrenia is contained in Diagnostic and Statistical Manual for 
Mental Disorders, Fourth Edition, Washington, D.C (1994): American Psychiatric 
Association, hereby incorporated by reference in its entirety. 

Schizophrenia is a clinical syndrome that has a profound influence on public health. 
25 The symptoms for schizophrenia begin early in life, and continues for most patients 
throughout their lives. An estimate of the direct and indirect costs of schizophrenia 
was thirty-three billion dollars for 1990 in the United States alone [Carpenter et al, 
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1994, supra]. Indeed, one of every forty dollars spent for total heath care 
expenditures in the United States is spent on treating schizophrenia [Rupp et ah, 
Psychiatric Clin, North Am., 16:413-423 (1993)]. Furthermore, estimates have been 
made suggesting that up to 50% of the homeless American population is 
5 schizophrenic [Bachrach, In: Treating the Homeless Mentally III, Washington, D.C., 
American Psychiatric Press, 13-40, Lamb et ah ed. (1992)]. 



The genetic factors in schizophrenia, though clearly documented to be present, are not 
O simple [Carpenter and Buchanan, N, Engl. J, Med., 330:681-689 (1994); Gottesman, 

Clin. Genet., 46:1 16-123 (1994)]. Schizophrenia is, at least in part, a 
10 neurodevelopmental disorder, a birth defect in which the brain has been subtly 
ru damaged during development [Carpenter and Buchanan, N. Engl. J. Med., 330:681- 

% 689 (1994); Weinberger, Arch. Gen. Psychiatry, 44:660-669 (1987); Brixey et ah, J. 

^ Clin. Psychol., 49:447-456 (1993)]. Evidence of this damage is seen both at autopsy 

yn [Kovelman and Scheibel, Biol. Psychiatry, 19:1601-1621 (1984); Bogerts et ah, Arch. 

15 Gen. Psychiatry, 42:784-791 (1985); Jakob and Beckman, J. Neural Transm., 
p 65:303-326 (1986); Brown et ah, Arch. Gen. Psychiatry, 43:36-42 (1986); Benes and 

^ Bird, Arch Gen Psychiatry, 44:608-616 (1987); Colter et ah, Arch Gen Psychiatry, 

44:1023 (1987); Altshuler et ah, Arch. Gen. Psychiatry, 47:1029-1034 (1990); 
Pakkenberg, Schizophr. Res., 7:95-100 (1992); Bogerts, Schizophr. Bull., 19:431-445 
20 (1993); Shapiro, Schizophr. Res., 10:187-239 (1993)] and by neuroimaging [Jeste et 
ah, Br. J. Psychiatry, 153:444-459 (1988); Suddath et ah, Am. J, Psychiatry, 
146:464-472 (1989); Suddath etah, N Engl. J. Med., 322:1%9-194 (1990); DeLisi et 
ah, Bioh Psychiatry, 29:159-175 (1991); Breier et ah, Arch, Gen, Psychiatry, 49:921- 
926 (1992); O'Callaghan et ah, J. R. Soc, Med., 85:227-231 (1992); Bogerts et ah, 
25 Biol. Psychiatry, 33:236-246 (1993); Andreasen et ah, Science, 266:294-298 (1994)]. 
The pattern of this brain damage and the presence of minor congenital abnormalities 
point to an insult occurring during the second trimester of fetal development [Bracha 
et ah, Biol, Psychiatry, 30:719-725 (1991); Bracha et ah, Am. J. Psychiatry, 
149:1355-1361 (1992); Green etah, Psychiatry Res., 53:119-127 (1994)]. 
30 Epidemiological studies have documented a season-of-birth effect by which 

schizophrenics are more frequently born during winter and early spring than during 
other seasons [Boyd et ah, Schizophr. Bulh, 12:173-186 (1986); Kendell and Adams, 



Br. J. Psychiatry, 158:758-763 (1991); O'Callaghan etal, Br. J. Psychiatry, 
158:764-769 (1991)]. Also, individuals exposed to an influenza epidemic [Mednick 
et al, Arch. Gen. Psychiatry, 45:189-192 (1988); Barr et al, Arch. Gen. Psychiatry, 
47:869-874 (1990); O'Callaghan etal.. Lancet., 337:1248-1250 (1991); Murray et 
5 al, J. Psychiatr. Res., 26:225-235 (1992); Adams et al, Br. J. Psychiatry, 163:522- 
534 (1993)] or famine [Susser and Lin, Arch. Gen. Psychiatry, 49:983-988 (1992)] 
during their second trimester of fetal development have increased risk of later 
developing schizophrenia, according to some studies but not others [Kendell, Arch. 
Gen. Psychiatry, 46:878-882 (1989); Crow and Done, Br. J. Psychiatry, 161:390-393 
10 (1992)]. This has suggested that an environmental effect such as dietary deficiency, 
Lfl virus infection [Kirch, Schizophr. Bull, 19:355-370 (1993)], vitamin deficiency, or 

sl effect of cold weather may be acting during fetal development. 

0J 

01 Linkage mapping studies in schizophrenia have been difficult. Recently, some 

studies [Straub et al, Nature Genet, 11:287-293 (1995); Schwab et al, Nature 
W\ 15 Genet., 11:325-327 (1995); Moises et al, Nature Genet, 11:321-324 (1995)] have 

supported a gene locus on chromosome 6 (6p24-22, near the HLA region) as having 
an effect in schizophrenia; other studies gave little or no support to a marker in this 
region [Wang et al, Nature Genet, 10:41-46 (1995); Mowry et al, Nature Genet, 
11:233-234 (1995); Gurling et al, Nature Genet, 11:234-235 (1995); Antonarakis et 
20 al, Nature Genet., 11:235-236 (1995)]. At best this locus appeared to be involved in 
only about 15-30% of families [Straub et al, 1995, supra]. Also, some evidence for 
loci on chromosomes 3 [Pulver et al, Am. J. Med. Genet., 60:252-260 (1995), 8 
[Pulver et al, Am. J. Med. Genet, 60:252-260 (1995); Kendler et al, Am. J. Psych. 
153:1534-1540 (1996), 9 [Coon etal, Biol. Psychiatry, 34:277-289 (1993); Moises et 
25 al, Nature Genet, 11:321-324 (1995)] and 22 [Coon et al, Am. J. Med. Genet, 
54:72-79 (1994); Pulver et al, Am. J. Med. Genet, 54:3-43 (1994)]have been 
reported. In addition, two polymorphic markers very close to the gene encoding 
dihydrofolate reductase (DHFR) on chromosome 5q, D5S76 and D5S39, gave very 
high lod scores (as high as 6.49, i.e. odds of about 3 million to one in favor of genetic 
30 linkage versus chance occurrence) in 7 British and Icelandic schizophrenia families 
studied [Schwab et al, Nat Genet. 11:325-327 (1997); Straub et al, Molec 
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Psychiatr. 2:148-155 (1997)]. However, this result could not be confirmed in studies 
of numerous other families. 

There could be several reasons for this difficulty. First, there may be more than one 
gene involved, (locus heterogeneity). Second, the genetic factor(s) may be common 
5 in the population (high disease allele frequency), thus diminishing the power of 
linkage studies [Terwilliger and Ott, Handbook of Human Genetic Linkage, 
Baltimore: Johns Hopkins Univ. Pr., 181 (1994)]. Third, the correct genetic model 
may be unknown [Owen, Psychol Med., 22:289-293 (1992)]. Any or all of these 
factors could diminish the power of a linkage study sufficiently to make success very 
10 difficult [Terwilliger and Ott, 1994, supra]. 



m Thus the current (developmental) model for schizophrenia is that genetic and 

;L environmental factors cause brain damage in a fetus that later develops schizophrenia. 

yi However, the genetic and environmental factors have not been identified. Also, 

Pi 

extensive linkage and association studies have failed to identify genes determining 

M 15 schizophrenia. 

O 

Indeed, schizophrenia appears to be just one of a family of developmental disorders 
whose cause has not been identified. Other such developmental disorders are defined 
by the Diagnostic and Statistical Manual for Mental Disorders, Fourth Edition, 
Washington, D.C (1994) and include: Tourette Syndrome which is identical to 

20 Tourette's Disorder and is a subcategory of Tic Disorders; Bipolar Disorder which is 
identical with Bipolar I Disorder or Bipolar II disorder; Autism which is identical 
with Autistic Disorder which is a subcategory of Pervasive Developmental Disorders; 
Conduct disorder which is a subcategory of Attention-Deficit and Disruptive 
Behavioral Disorders; Attention-Deficit Hyperactivity Disorder which is identical to 

25 Attention-Deficit/Hyperactivity Disorder and to Attention-Deficit/Hyperactivity 
Disorder NOS (not otherwise specified) which is also a subcategory of Attention- 
Deficit and Disruptive Behavioral Disorders; Obsessive-Compulsive Disorder which 
is a subtype of Anxiety Disorders; Chronic Multiple Tics Syndrome which is identical 
to Chronic Motor or Vocal Tic Disorder which is a subtype of Tic Disorders; and 

30 Learning Disorders. 
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In addition Spina bifida is a developmental disorder. Spina bifida is a form of neural 
tube defect in which neural elements (spinal nerves or spinal chord) or coverings of 
the brain and spinal chord (dura mater, arachnoid mater) herniate through a midline 
defect into a cystic cavity covered completely or partially by skin. 

Therefore, there is a need for new methods of diagnosing individuals susceptible to 
developing a developmental disorder. In addition, there is a need for methods of 
identifying individuals susceptible to having offspring that develop a developmental 
disorder. Finally, there is a need for a method of treating such susceptible individuals 
in order to prevent and/or ameliorate the symptoms due to and/or associated with the 
developmental disorder. 

The citations of any reference herein should not be construed as an admission that 
such reference is available as 'Trior Art" to the instant application. 

SUMMARY OF THE INVENTION 

The present invention provides methods of diagnosing, preventing and/or treating 
15 specific developmental disorders. Towards this end the present invention provides 
methods of identifying an individual as being genetically or environmentally 
susceptible for developing or having a developmental disorder or for having offspring 
that develop the developmental disorder. Such a developmental disorder can be 
schizophrenia, spina bifida cystica, Tourette's syndrome, bipolar illness, autism, 
20 conduct disorders, attention deficit disorder, obsessive compulsive disorder, chronic 
multiple tic syndrome and learning disorders such as dyslexia. In addition, any of the 
methods provided herein for identifying an individual as being genetically and/or 
environmentally susceptible for having or developing a developmental disorder or for 
having offspring that develop the developmental disorder can also be used in 
25 diagnosing the individual, preferably in conjunction with a clinical diagnosis. 

Therefore, the present invention provides methods of identifying an individual as 
being genetically susceptible for having or developing a developmental disorder. 
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The present invention further provides methods of identifying an individual as being 
genetically susceptible for having offspring that are susceptible for developing a 
developmental disorder. Methods of identifying an individual as being susceptible 
due to environmental factors for having or developing a developmental disorder are 
5 also provided. In addition, the present invention provides methods of identifying an 
individual as being susceptible of having offspring that are susceptible for developing 
a developmental disorder. The present invention also provides methods of identifying 
an individual as being susceptible for having or developing a developmental disorder 
due to both environmental and genetic factors. The present invention further provides 
10 methods of identifying an individual as being susceptible for having offspring that are 
susceptible for developing a developmental disorder 

The present invention therefore provides methods for compiling genetic reference 
datasets, environmental reference datasets and/or genetic and environmental reference 
datasets for use in determining a predicted probability for an individual of having a 
15 susceptibility for having or developing a developmental disorder, or for having 
offspring that develop a developmental disorder. 

In one aspect of the invention, the present invention provides methods that comprise 
generating a genetic reference dataset for use in determining the predicted probability 
of an individual for having a susceptibility for having or developing a developmental 
20 disorder due to genetic factors, or for having offspring that develop a developmental 
disorder due to genetic factors. 

One such embodiment comprises collecting a biological sample from a human 
subject. The human subject can be a diagnostic proband, a blood relative of the 
diagnostic proband, an affected proband, a blood relative of the affected proband, a 

25 control proband, and/or a blood relative of the control proband. The biological 

sample contains nucleic acids and/or proteins from the human subject. The nucleic 
acids and/or proteins from the biological sample are then analyzed resulting in a 
partial or full genotype for the alleles of the genes involved in folate, pyridoxine, 
and/or cobalamin metabolism. The partial or full genotype then forms a dataset of 

30 genetic explanatory variables for the human subject. The dataset of genetic 
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explanatory variables is then compiled from multiple human subjects into a genetic 
reference dataset. Such compilations are exemplified in the Detailed Description and 
Examples below. 

In another aspect, the present invention provides a method that comprises generating 
5 a genetic and environmental reference dataset for use in determining the predicted 
probability of an individual for having a susceptibility for having or developing a 
developmental disorder due to genetic factors and environmental factors, or for 
having offspring that develop a developmental disorder due to genetic factors and 
i» s environmental factors. One such embodiment comprises obtaining dietary and 

10 epidemiological information for environmental explanatory variables for the human 
SI subjects and combining the environmental explanatory variables with a genetic 

n ] reference dataset for the human subjects as described above. 

m 

nil 

s In another aspect, the present invention provides an environmental reference dataset 

for use in the determination of the predicted probability for an individual for having a 
0J 1 5 susceptibility for having or developing a developmental disorder due to 

Q environmental factors, or for having offspring that develop a developmental disorder 

^ due to environmental factors One such embodiment comprises obtaining dietary and 

epidemiological information for environmental explanatory variables for a human 
subject. The human subject can be a diagnostic proband, a blood relative of the 
20 diagnostic proband, an affected proband, a blood relative of the affected proband, a 
control proband, or a blood relative of the control proband. The dataset of 
environmental explanatory variables is then compiled from multiple human subjects 
into an environmental reference dataset for the human subjects. 

The developmental disorder forming the basis of the reference datasets of the present 
25 invention can be schizophrenia, or spina bifida cystica, or Tourette's syndrome, or 
dyslexia, or conduct disorder, or attention-deficit hyperactivity disorder, or bipolar 
illness, or autism, or chronic multiple tic syndrome or obsessive-compulsive disorder, 
or like disorders. A blood relative is preferably the mother of the individual, a 
sibling, the father or a grandparent of the individual. When the reference dataset is 
30 for use in the determination of the predicted probability for an individual of having a 
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susceptibility for having offspring that develop a developmental disorder, the 
individual is preferably a pregnant woman. The reference datasets of the present 
invention are themselves part of the present invention. 



The present invention further provides methods of estimating the genetic 
5 susceptibility of an individual to have or to develop a developmental disorder, or to 
have offspring that develop a developmental disorder. In one such embodiment the 
method comprises collecting a biological sample from a participant (or participants) 
who is either the individual or a blood relative of the individual. The biological 
4s sample contains nucleic acids and/or proteins of the participant. The analysis of the 

Zj 10 nucleic acids and/or proteins from the biological sample yield a partial or full 

2i genotype for the alleles of the genes involved in folate, pyridoxine, and/or cobalamin 

i y 

01 metabolism. The partial or full genotype forms a dataset of genetic explanatory 

j** variables for the participants. The dataset of genetic explanatory variables obtained 

P are added to a genetic reference dataset forming a combined genetic dataset. A model 

E p 

PI j 15 is then formulated comprising the genetic explanatory variables obtained from the 

J4 participants and the combined genetic dataset is analyzed. A predicted probability for 

Cj the individual for having and/or developing a developmental disorder and/or having 

offspring that develop a developmental disorder is then determined. The genetic 
susceptibility of an individual to have or to develop a developmental disorder and/or 
20 have offspring that develop a developmental disorder is estimated. In a preferred 
embodiment, analyzing the combined genetic dataset is performed by binary linear 
regression. In a more preferred embodiment, the binary linear regression is 
performed with the SAS system. In another preferred embodiment, the model is 
modified by adding or subtracting one or more genetic explanatory variables and the 
25 combined genetic dataset is re-analyzed, preferably by binary logistic regression. In 
this case a model is chosen that best fits the data. This can be accomplished by 
testing the model for goodness of fit. 

The present invention also provides methods of estimating the genetic and 
environmental susceptibility of an individual to have or to develop a developmental 
30 disorder and/or for having offspring that develop a developmental disorder. One such 
embodiment comprises collecting a biological sample from one or more participants. 
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Again, the participant is either the individual or a blood relative of the individual. 
The biological sample contains nucleic acids and/or proteins of the participant. The 
nucleic acids and/or proteins from the biological sample are analyzed resulting in a 
partial or full genotype for the alleles of the genes involved in folate, pyridoxine, 
5 and/or cobalamin metabolism. The partial or full genotype forms a dataset of genetic 
explanatory variables for the participant. Dietary and epidemiological information 
for environmental explanatory variables for the participant(s) are also obtained which 
' are used to form a dataset of environmental explanatory variables for the 
participant(s). The datasets of genetic explanatory variables and the dataset of 

10 environmental explanatory variables are added to a genetic and environmental 

reference dataset forming a combined genetic and environmental dataset. A model is 
formulated comprising the genetic and environmental explanatory variables obtained 
from the participant(s). The combined genetic and environmental dataset is then 
analyzed and a predicted probability for the individual for having and/or developing a 

15 developmental disorder and/or for having offspring that develop a developmental 

disorder is determined. The genetic and environmental susceptibility of an individual 
to have or to develop a developmental disorder and/or have offspring that develop a 
developmental disorder is estimated. In a preferred embodiment, analyzing the 
combined genetic and environmental dataset is performed by binary linear regression. 

20 In a more preferred embodiment the binary linear regression is performed with the 
SAS system. In another preferred embodiment the model is modified by adding or 
subtracting one or more genetic and/or environmental explanatory variables and the 
combined genetic and environmental dataset is re-analyzed preferably, by binary 
logistic regression. In this case a model is chosen that best fits the data. This can be 

25 accomplished by testing the model for goodness of fit. 

For any of these methods, the developmental disorder can be schizophrenia, spina 
bifida cystica, Tourette's syndrome, bipolar illness, autism, conduct disorder, 
attention deficit hyperactivity disorder, obsessive compulsive disorder, chronic 
multiple tic syndrome and learning disorders such as dyslexia. 



30 In a particular embodiment, the individual is suspected of being genetically 

susceptible of having or for developing the developmental disorder and/or of being 



10 

genetically susceptible of having offspring that develop the developmental disorder. 
In a preferred embodiment of this type, the individual is suspected of being 
genetically susceptible for having or for developing the developmental disorder 
and/or of being genetically susceptible of having offspring that develop the 
5 developmental disorder because a blood relative has the developmental disorder. In 
one such embodiment the blood relative is a parent, a sibling, or a grandparent. In a 
preferred embodiment the blood relative is the mother of the individual. In a 
particular embodiment in which the individual is suspected of being genetically 
susceptible of having offspring that develop the developmental disorder, the 
aj 1 0 individual is a pregnant woman. In another such embodiment the individual is the 

mate of the pregnant woman. In a particular embodiment exemplified below, the 
developmental disorder is schizophrenia. 
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Since the availability of the data regarding the genetic and environmental explanatory 

G3 factors can vary in separate determinations, variations in the explanatory factors used 

Ul 

m 15 is clearly envisioned by the present invention. 

yj 

iL. J 

p The present invention further provides methods of lowering the risk of a pregnant 

woman to have a child that will develop a developmental disorder. One such 
embodiment comprises administering methylfolate, cobalamin or pyridoxine to the 
pregnant woman and/or fetus, which lowers the risk of the pregnant woman to give 

20 birth to a child with a developmental disorder. In a particular embodiment of this 

type, the pregnant woman had been previously determined to be susceptible of having 
offspring that develop a developmental disorder by a method disclosed herein. The 
present invention further provides a method of determining if any treatment is 
advisable for a pregnant woman that is genetically susceptible to having offspring that 

25 develop a developmental disorder which comprises determining the concentration of 
a risk factor from a tissue sample or body fluid from the pregnant woman. When the 
concentration of the risk factor is statistically above or below an accepted normal 
range, treatment is advisable. 
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The present invention further provides methods of determining if any treatment is 
advisable for a pregnant woman who has been determined to be susceptible to having 
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offspring that develop a developmental disorder. One such embodiment comprises 
determining the concentration of a risk factor from a tissue sample or body fluid from 
the pregnant woman. When the concentration of the risk factor is statistically above 
or below an accepted normal range, treatment is advisable. In a particular 
5 embodiment of this type, the pregnant woman had been previously determined to be 
susceptible of having offspring that develop a developmental disorder by a method 
disclosed herein. 

Methods of monitoring the effect of the administration of methylfolate, cobalamin or 
pyridoxine to the pregnant woman who has been determined to be susceptible to 

10 having offspring that develop a developmental disorder are also included in the 

present invention. One such embodiment comprises determining the concentration of 
a risk factor from a tissue sample or body fluid from the pregnant woman. When the 
concentration of the risk factor is statistically within an accepted normal range, the 
treatment is deemed effective. In a particular embodiment of this type, the pregnant 

15 woman had been previously determined to be susceptible of having offspring that 
develop a developmental disorder by a method disclosed herein. The risk factor can 
be any substance and/or metabolite linked to folate and/or cobalamin and/or 
pyridoxine metabolism. In one embodiment, the risk factor is homocysteine. In yet 
another embodiment, the risk factor is folate. In still another embodiment, the risk 

20 factor is cobalamin. 

The present invention also provides a method of treating an asymptomatic individual 
determined to be susceptible for developing a developmental disorder comprising 
administering methylfolate, cobalamin and/or pyridoxine. In a particular embodiment 
of this type, the asymptomatic individual had been previously determined to be 
25 susceptible of developing a developmental disorder by a method disclosed herein. 



The DNA samples from the persons tested may be obtained from any source 
including blood, a tissue sample, amniotic fluid, a chorionic villus sampling, 
cerebrospinal fluid, and urine. 
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The present invention includes but is not limited to the examples of proteins encoded 
by genes involved in folate, cobalamin and pyridoxine metabolism compiled in 
Tables 2-7 in the Detailed Description of the Invention, below. For certain genes 
nucleic acid and/or amino acid sequence data is also provided. These genes and 
5 related sequence data are solely intended as examples of genes that are suitable to be 
used in the methods described herein. Such sequence data can be used for carrying 
out the genetic analysis of the present invention. However, the present invention is 
not intended to be limited in any way to such lists of proteins or the related sequence 
P % data. 

Si 10 It is further contemplated by the present invention to provide methods that include the 

^ t testing for a genetic mutations in individual genes involved in folate and cobalamin 

0 1 metabolism and/or in individual combinations of such genes (e.g. , 

s methylenetetrahydrofolate reductase gene and methionine synthase). In addition, all 

possible combinatorial, and permutations of such genes including a constellation 

y = 

Hi 15 comprising all of the genes involved in folate, pyridoxine, and cobalamin metabolism 

yj 

f?j is envisioned by the present invention. Alternatively, a constellation of genes in 

^ which any one or more genes can be excluded from those tested is also contemplated 

by the present invention (for example, a given constellation of genes can include 
genes encoding all of the proteins in Table 2 and 4 except the folate receptor 2-like 
20 protein). Thus all of such possible constellations are envisioned by, and are therefore 
part of the present invention. 

The present invention also provides DNA polymorphisms that can be used as genetic 
explanatory factors in the present invention. One such embodiment is a nucleic acid 
encoding a genetic variant of human dihydro folate reductase comprising a nucleotide 
25 sequence having a 19 base-pair deletion spanning nucleotides 540 to 558 of the 

nucleotide sequence of SEQ ED NO:41 . In a preferred embodiment the nucleic acid 
has the nucleotide sequence of SEQ ID NO: 42. 
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The present invention also includes primers. One such embodiment is a PCR primer 
that can be used to distinguish SEQ ID NO:42 from SEQ ID NO:41. Another 
embodiment is a PCR primer that can be used to distinguish SEQ ID NO:42 from 
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SEQ ED NO:45. These primers are useful for identifying the 19 base-pair deletion 
spanning nucleotides 540 to 558 of the nucleotide sequence of SEQ ID NO:41 (see 
Example 2). In a particular embodiment, the PCR primer comprises 8 to 100 and 
preferably 10 to 50 consecutive nucleotides from the nucleotide sequence of SEQ ID 
5 NO:41. In another embodiment the PCR primer comprises 8 to 100 and preferably 10 
to 50 consecutive nucleotides from the nucleotide sequence of the complementary 
strand of SEQ ID NO:41. In still another embodiment the PCR primer comprises 8 to 
100 and preferably 10 to 50 consecutive nucleotides from the nucleotide sequence of 
SEQ ID NO:42. In yet another embodiment the PCR primer comprises 8 to 100 and 
~ 10 preferably 10 to 50 consecutive nucleotides from the nucleotide sequence of the 

43 complementary strand of SEQ ID NO:42. In still another embodiment the PCR 

Sj primer comprises 8 to 100 and preferably 10 to 50 consecutive nucleotides from the 

2^ nucleotide sequence of SEQ ID NO:45. In yet another embodiment the PCR primer 

CP comprises 8 to 100 and preferably 10 to 50 consecutive nucleotides from the 

01 

- 15 nucleotide sequence of the complementary strand of SEQ ID NO:45 . 

yi 

f!i In a particular embodiment the PCR primer comprises 8 to 100 and preferably 10 to 

hi 

50 consecutive nucleotides from nucleotides 350 to 530 of SEQ ID NO:41. In a 
^ preferred embodiment of this type, the PCR primer has the nucleotide sequence of 

CTAAACTGCATCGTCGCTGTG (SEQ ID NO:38). In another particular 

20 embodiment the PCR primer comprises 8 to 100 and preferably 10 to 50 consecutive 
nucleotides from the complementary strand of nucleotides 550 to 850 of SEQ ID 
NO:41. In preferred embodiment of this type, the PCR primer comprises 8 to 100 
and preferably 10 to 50 consecutive nucleotides from the complementary strand of 
nucleotides 570 to 690 of SEQ ID NO:41 . In a particular embodiment, the PCR 

25 primer has the nucleotide sequence of AAAAGGGGAATCCAGTCGG (SEQ ID 
NO:39). 



The present invention also provides a nucleic acid that hybridizes under standard 
hybridization conditions to the nucleotide sequence ACCTGGGCGGGACGCGCCA 
(SEQ ID NO:40). In another embodiment the nucleic acid hybridizes under standard 
30 hybridization conditions to the nucleotide sequence complementary to SEQ ID 
NO:40. In yet another embodiment the nucleic acid hybridizes under standard 
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hybridization conditions to the nucleotide sequence ACCTGGGCGGGACGCGCC 
(SEQ ID NO:46). In yet another embodiment the nucleic acid hybridizes under 
standard hybridization conditions to the nucleotide sequence complementary to SEQ 
ID NO:46. In a particular embodiment the nucleic acid consists of 9 to 96 
5 nucleotides. In another embodiment the nucleic acid consists of 12 to 48 nucleotides. 
In still another embodiment the nucleic acid consists of 15 to 36 nucleotides. In a 
preferred embodiment the nucleic acid consists of 17 to 20 nucleotides. 
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The present invention also provides a nucleic acid that hybridizes to the nucleotide 
sequence of SEQ ID NO:41 5 but not to the nucleotide sequence of SEQ ID NO:42 
10 when the hybridization is performed under identical conditions. In a particular 
embodiment the nucleic acid comprises the nucleotide sequence of 
CCCACGGTCGGGGTACCTGGGCGGGACGCGCCAGGCCGACTCCCGGCGA 
(SEQ ID NO:29). The present invention further provides a nucleic acid that 
!L hybridizes to the nucleotide sequence of SEQ ID NO:42 5 but not to the nucleotide 

yl 15 sequence of SEQ ID NO:41 when the hybridization is performed under identical 

conditions. In a particular embodiment the nucleic acid comprises the nucleotide 
sequence of CCCACGGTCGGGGTGGCCGACTCCCGGCGA (SEQ ID NO:37). 

In a related embodiment the present invention provides an isolated nucleic acid that 
hybridizes to the complementary strand of the nucleotide sequence of SEQ ID NO:42, 
20 but not to the complementary strand of the nucleotide sequence of SEQ ID NO:41 
when the hybridization is performed under identical conditions. In still another 
embodiment the nucleic acid hybridizes to the nucleotide sequence of SEQ ID 
NO:41, but not to the nucleotide sequence of SEQ ID NO:42 when the hybridization 
is performed under identical conditions. In still another embodiment the nucleic acid 
25 hybridizes to the complementary strand of the nucleotide sequence of SEQ ID NO:41 5 
but not to the complementary strand of the nucleotide sequence of SEQ ID NO:42 
when the hybridization is performed under identical conditions. 



30 



The present invention also provides a nucleic acid that hybridizes to the nucleotide 
sequence of SEQ ID NO:42, but not to the nucleotide sequence of SEQ ID NO:45 
when the hybridization is performed under identical conditions. In a related 
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embodiment the present invention provides an isolated nucleic acid that hybridizes to 
the complementary strand of the nucleotide sequence of SEQ ID NO:42, but not to 
the complementary strand of the nucleotide sequence of SEQ ID NO:45, when the 
hybridization is performed under identical conditions. In still another embodiment 
5 the nucleic acid hybridizes to the nucleotide sequence of SEQ ID NO:45, but not to 
the nucleotide sequence of SEQ ID NO:42 when the hybridization is performed under 
identical conditions. In still another embodiment the nucleic acid hybridizes to the 
complementary strand of the nucleotide sequence of SEQ ID NO:45, but not to the 
complementary strand of the nucleotide sequence of SEQ ID NO:42 when the 
10 hybridization is performed under identical conditions. 

yi 

t j The present invention also provides for the use of the nucleic acids of the present 

^ 4 invention (as well as other nucleic acids which can be used to identify DNA 

Hi 

yi polymorphisms in the alleles of the genes involved in folate, pyridoxine, and/or 

01 

2 cobalamin metabolism) in the methods of the present invention for identifying, 
W 15 diagnosing, preventing and/or treating individuals. 

%# -: 

ni s 

3 Ss? 

SI In methods of estimating the susceptibility due to genetic or genetic and 

CJ environmental factors for an individual to have or to develop a developmental 

disorder or to have offspring that develop a developmental disorder, and for the 
corresponding methods of generating genetic, or genetic and environmental reference 
20 datasets, the present invention provides a step of analyzing nucleic acids and/or 

proteins from biological samples. In one particular embodiment, the assaying for the 
presence of the genetic variant of human dihydro folate reductase having a nucleotide 
sequence with a 19 base-pair deletion spanning nucleotides 540 to 558 of the 
nucleotide sequence of SEQ ID NO:41 is included as part of this analysis. This 
25 genetic variant of human dihydrofolate reductase becomes a genetic explanatory 
variable. 



Determining if the biological sample contains the genetic variant of human 
dihydrofolate reductase having a nucleotide sequence with a 19 base-pair deletion 
spanning nucleotides 540 to 558 of the nucleotide sequence of SEQ ID NO:41 can be 
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performed by any appropriate method including PCR, special PCR, RT PCR, RFLP 
analysis, SSCP, and FISH. 

In addition, all of the nucleic acids of the present invention including cDNA or 
genomic DNA can be placed into expression vectors operably associated with an 
expression control sequence. Alternatively, when the nucleic acid is part of an 
expression control sequence, the nucleic acid and/or the expression control sequence 
can be placed into an expression vector to control the expression of a coding 
sequence, such as a reporter gene. Such expression vectors can then be placed into 
either eukaryotic or prokaryotic host cells and expressed. The host cells comprising 
the expression vectors are also part of the present invention. In addition, when the 
nucleic acid includes a coding sequence or a part of a coding sequence, the present 
invention includes methods of purifying the gene products from the coding sequence 
or part thereof, and the purified gene products themselves. 

Accordingly, it is a principal object of the present invention to provide a method for 
identifying an individual that is genetically inclined to develop a developmental 
disorder or disease. 

It is a further object of the present invention to provide a method for identifying an 
individual that is genetically inclined to develop schizophrenia. 

It is a further object of the present invention to provide a method for identifying an 
individual that is genetically inclined to have offspring having a developmental 
disorder. 

It is a further object of the present invention to provide a method of diagnosing 
schizophrenia. 

It is a further object of the present invention to provide a method of treating 
developmental disorders such as schizophrenia. 
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It is a further object of the present invention to provide a method for monitoring the 
treatment of the developmental disorder. 

It is a further object of the present invention to provide a method for ameliorating the 
effect of a defect in folate, pyridoxine or cobalamin metabolism on a fetus due to the 
5 genetic or environmental status of a pregnant woman. 

It is a further object of the present invention to provide a method of treating a patient 
who is genetically inclined to develop a developmental disorder such as 
schizophrenia. 



It is a further object of the present invention to provide a method of overcoming a 
1 0 nutritional lack of folate, cobalamin or pyridoxine of a pregnant woman to prevent the 
development of the corresponding fetus developing a developmental disorder. 



Other objects and advantages will become apparent to those skilled in the art from a 
Hi review of the ensuing description. 

W These and other aspects of the present invention will be better appreciated by 

1 5 reference to the following drawings and Detailed Description. 



BRIEF DESCRIPTION OF THE DRAWINGS 



Figure 1 shows primers for PCR amplification of the dihydro folate reductase (DHFR) 
deletion polymorphism region. 



Figure 2 shows the genotypes of the DHFR 19 basepair deletion by non-denaturing 
20 polyacrylamide gel electrophoresis. Lanes 1 and 2 show genotypes 1,1. Lanes 3 and 4 
show genotypes 1, 2. Lanes 5 and 6 show genotypes 2,2. Lane 7 shows phiX174 RF 
DNA/Haelll size markers from BRL Life Technologies. 

Figure 3 shows the sequences of PCR amplification products in the Region of the 
DHFR polymorphism region. * is explained in Text, see Example 2. 
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Figure 4 A is a nucleotide sequence of the wild type human DHFR, (SEQ ID NO:41) 
from Yang et al., J. Mol. Biol 176:169-187 (1984), GeneBank accession no: X00855. 
The start codon is in bold. Figure 4B is the same nucleotide sequence as that of 
Figure 4A except the deletion of the 19 nucleotides due to the DHFR deletion 
5 polymorphism, (SEQ ID NO:42). 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention in its broadest embodiment provides a method of diagnosing, 
preventing and/or treating specific physiological/developmental disorders. Such 
physiological/developmental disorders include schizophrenia, spina bifida cystica, 
Tourette's syndrome, bipolar illness, autism, conduct disorders, attention deficit 
disorder, obsessive compulsive disorder, chronic multiple tic syndrome and learning 
disorders such as dyslexia. 

A particular aspect of the present invention provides methodology for diagnosing, 
preventing and/or treating a developmental disorder such as schizophrenia. Such 
methodology is premised on the correlation between abnormalities in folate, 
cobalamin, and/or pyridoxine metabolism in an individual and/or the mother of an 
individual and the occurrence of the developmental disorder, e.g., schizophrenia in 
the individual. Further, the present invention provides a framework (i.e., the gene- 
teratogen model, and the DNA Polymorphism-Diet-Cofactor-Development both of 
which are described in detail below) which fully explain the rationale for the 
correlation, though the ultimate usefulness of the methods of the present invention are 
independent of any particular model. 

Within this context, the DNA Polymorphism-Diet-Cofactor-Development model 
maintains that a developmental disorder such as schizophrenia results in part from 
25 developmental brain damage sustained in utero due to maternal dietary deficiency of 
folate, pyridoxine or cobalamin potentiated by the aggregate effect of minor defects 
of folate, pyridoxine or cobalamin genes. The maternal damage to the fetus can result 
in part from insufficiency of the folate, pyridoxine and cobalamin themselves and/or 
from resulting effects such as immune deficiency and maternal teratogens, e.g. 
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hyperhomocysteinemia. Genes from either parent acting in the fetus may modify 
these damaging effects as exemplified in the gene-teratogen model, below. 

As described herein the present invention can be practiced on a case by case basis, or 
alternatively, it can be used in the screening of the general population, or within any 
5 particular subgroup, such as newborns (as is presently performed in the diagnosis and 
treatment of hyperphenylalaninemia). 

Therefore, if appearing herein, the following terms shall have the definitions set out 
below. 

As used herein a "gene involved in folate, pyridoxine, or cobalamin metabolism" is a 
10 gene that encodes a peptide or protein that plays a role in a pathway involved in either 
folate, pyridoxine, or cobalamin metabolism. An incomplete listing of examples of 
such proteins is given in Tables 2-7. 

As used herein the term "individual" includes a fetus, infant, child, adolescent, and 
adult. Therefore, as used herein, an individual originates at conception. 

15 As used herein an individual with a susceptibility for "having offspring that develop a 
developmental disorder" is meant to be indicative of the susceptibility of the 
offspring of that individual to develop the developmental disorder and is not in any 
way meant to be indicative of the susceptibility of the individual to have offspring. 

The term "proband" as used herein is operationally defined by Table 8 along with the 
20 accompanying explanatory information (see, Example 1). For most purposes, the 
proband can be considered the central figure in the familial analysis, the remaining 
individuals in the family being designated as "blood relatives". There are three types 
of probands: (1) an "affected proband" i.e., an individual that is believed to have a 
developmental disorder ; (2) a "control proband" an individual that is believed not to 
25 have a developmental disorder; and (3) a "diagnostic proband" i.e., an individual 
being diagnosed. 
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As used herein a "blood relative" of an individual is a relative that is related to the 
individual in a genetic sense. Blood relatives can include mothers, fathers, children, 
uncles, aunts, brothers, sisters, and grandparents. Preferably a blood relative is a 
parent, a sibling, or a grandparent. Adopted relatives, step-parents, relatives through 
5 marriage and the like are not blood relatives. Therefore, as used herein, the terms 
"mother", "father", "sibling", "grandparent", "grandfather" and "grandmother" are 
indicative of blood relationships. 



As used herein a "mate of an individual" is a person whose genetic material is 
combined with that of the individual for the conception of the offspring in question. 

10 As used herein the term "schizophrenia" describes a disorder that is at least partially 
due to one or more genetic mutations or polymorphisms in one or more genes 
involved in folate, cobalamin or pyridoxine metabolism in an individual that is 
schizophrenic and/or to one or more genetic mutations or polymorphisms in one or 
more genes involved in folate, cobalamin or pyridoxine metabolism in the mother of 

15 that individual. 

As used herein an individual is "schizophrenic" when the individual displays 
symptoms that would be accepted by an experienced psychiatrist to merit a diagnosis 
of schizophrenia. Such a diagnosis is based, at least in part, on the currently evolving 
guidelines for the diagnosis of schizophrenia which are listed in the successive 
20 editions of Diagnostic and Statistical Manual for Mental Disorders, put out by the 
American Psychiatric Association. The current edition is the DSM, Fourth Edition 
(1994). 

As used herein the terms "spina bifida cystica", "Tourette's syndrome", "bipolar 
illness", "autism", "conduct disorder", "attention deficit disorder", "obsessive 
25 compulsive disorder", "chronic multiple tic syndrome" and "learning disorders" such 
as "dyslexia"describe disorders which display symptoms that would be accepted by 
an experienced psychiatrist to merit a diagnosis of that disorder. Such a diagnosis is 
based, at least in part, on the currently evolving guidelines which are listed in the 
successive editions of Diagnostic and Statistical Manual for Mental Disorders, put out 
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by the American Psychiatric Association. The current edition is the DSM, Fourth 
Edition (1994). 

As used herein the term "teratogenic locus" indicates one or more alleles that act in a 
pregnant woman to cause an intrauterine teratogenic effect on the fetus. 

5 As used herein the terms "specificity locus" or "modifying locus" are used 
interchangeably and are indicative of one or more alleles that can act during 
pregnancy and/or after birth to prevent, modify, and/or ameliorate the teratogenic 
effect of the teratogenic locus. 

As used herein a "constellation of genetic mutations" is the set of genetic risk factor 
10 mutations that is present in a proband and relatives of the proband. One example of a 
constellation of genetic mutations is shown in a line of Table 8, below. 

As used herein a "risk factor" is a teratogen or substance (including a defective gene) 
that can lead to a teratogenic effect that is present or suspected of being present in a 
tissue sample or body fluid of an individual's mother during the individual's gestation 
1 5 and/or present or suspected of being present in a tissue sample or body fluid of the 
individual. 

As used herein a "genetic risk factor" is used interchangeably with the term "genetic 
explanatory variable" and is a genetic mutation and/or polymorphism that causes or 
potentially can cause the formation of and/or lead to the development of a risk factor 
20 in an individual or the individual's mother during gestation. 

As used herein an "environmental risk factor" is used interchangeably with the term 
"environmental explanatory variable" and is an environmental factor that causes or 
potentially can cause the formation of and/or lead to the development of a risk factor 
in an individual or the individual's mother during gestation. 
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As used herein an "explanatory variable" is either an "environmental explanatory 
variable" or a "genetic explanatory variable" or the variable defined by their 
interaction or any combination of the above. 

Enzymes whose deficiency may raise plasma homocysteine include 
5 methylenetetrahydrofolate reductase (MTHFR), methionine synthase, and folate 
receptors/transport proteins/binding proteins (as well as all of the proteins listed in 
Tables 2-7 below). 

The current (developmental) model for schizophrenia is that genetic and 
environmental factors cause brain damage in a fetus that later develops schizophrenia. 

10 However, the genetic and environmental factors have not been identified. Also, 
extensive linkage and association studies have failed to identify genes determining 
schizophrenia. The reasons usually given for this difficulty include: (i) locus 
heterogeneity, i.e., more than one gene locus is involved, perhaps many gene loci 
each with a small effect; (ii) the mode of inheritance of schizophrenia is unknown; 

15 and (iii) an additional possible factor is that the frequency of the disease alleles may 
be high, thus greatly reducing the power of linkage studies. 

The DNA Polymorphism-Diet-Cofactor-Development model explains all of these 
difficulties and at the same time proposes a unified metabolic abnormality. The 
unified metabolic abnormality is: (a) ENVIRONMENTAL, i.e., due to a 

20 folate/cobalamin/pyridoxine deficiency caused by either decreased ingestion or 
increased requirement during pregnancy; (b) GENETIC, i.e., due to a 
folate/cobalamin/pyridoxine genetic defect caused by the aggregate effect of multiple 
mutations of folate/cobalamin/pyridoxine genes each individually having a small 
effect; and (c) the interaction of the folate/cobalamin/pyridoxine environmental and 

25 genetic factors (indicated above) to cause other harmful effects such as maternal 
teratogens and immune deficiency during gestational development. Different gene 
loci and different combinations of gene loci will be involved in different patients and 
different families. The problem of locus heterogeneity is addressed by the hypothesis 
that the folate/cobalamin/pyridoxine genetic defect is the aggregate effect of multiple 
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mutations of folate/cobalamin/pyridoxine genes each of which have a relatively small 
effect. 

The problem of mode of inheritance is addressed by the gene-teratogen model. The 
gene-teratogen model describes the special features of genes acting in utero; both 
5 teratogenic and modifying of specificity loci may be involved. If these effects are not 
taken into account, the assignment of affection status in schizophrenia pedigrees is 
inaccurate. Assignment of affection status is a key element in defining the mode of 
inheritance for all kinds of linkage mapping. Failure to assign the correct mode of 
yp inheritance is another factor that has made the linkage studies very difficult. 

ill 

^ 1 0 Finally, the DNA Polymorphism-Diet-Cofactor-Development model proposes that 

01 some of the genetic factors for schizophrenia are common in the population. In fact, 

j = subclinical deficiency of folate, pyridoxine, and cobalamin is common in the 

p population and common among pregnant women as well. Pregnancy further 

ry increases the requirement for folate, pyridoxine, and cobalamin. Common genetic 

^ 15 polymorphisms of folate and cobalamin genes are also known, some of them 

C3 functional. Common genetic risk factors tend to be functional polymorphisms and/or 

mutant alleles that individually have small effects. Otherwise, they would be largely 
eliminated from the population by natural selection and would not be common. High 
disease allele frequency is yet another factor that greatly diminishes the power of a 
20 linkage study. 

Besides explaining the difficulties with current linkage studies, the DNA 
Polymorphism-Diet-Cofactor-Development model explains all of the unusual 
biological and epidemiological features of schizophrenia: e.g. the decreased amount 
of gray matter in brain areas, the unusual birth-month effect, the geographical 

25 differences in incidence, the socioeconomic predilection, the association with 

obstetrical abnormalities (low birth weight and prematurity), and the association with 
famine and viral epidemics. Consistently, genetic linkage and cytogenetic studies in 
schizophrenia have implicated various chromosome regions, some of them containing 
folate, pyridoxine, and cobalamin genes including dihydro folate reductase, 

30 thymidylate synthase, and transcobalamin II. The DNA 
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Polymorphism-Diet-Cofactor-Development model predicts that folate, pyridoxine, or 
cobalamin gene mutations have a high frequency in schizophrenia patients or family 
members. Furthermore, mothers of schizophrenics are predicted to be particularly 
susceptible to producing one or more teratogens during pregnancy. 
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5 The present invention therefore provides methods for: (a) Diagnostic testing of 
schizophrenia by identifying a folate, pyridoxine, or cobalamin gene mutation or 
constellation of mutations in the patient, mother, and father, (b) Prevention of 
schizophrenia by diagnostic testing in families already affected by schizophrenia or 
by diagnostic population screening for folate mutations and identifying couples at risk 

10 for producing schizophrenic offspring. These pregnancies can be further monitored 
for risk factors, e.g. dietary folate/pyridoxine/cobalamin, plasma 
folate/pyridoxine/cobalamin, or red blood cell folate; plasma homocysteine or other 
teratogens, (c) Therapy for schizophrenia, e.g., treating the pregnant mother with 
folate, pyridoxine, cobalamin or other agents. The treatment can be monitored at 

1 5 regular intervals to determine the effect of therapy, (d) Presymptomatic treatment of 
schizophrenia on young children found to be susceptible to schizophrenia by 
diagnostic testing for folate gene mutations and other risk factors can also be treated 
with methylfolate or related therapeutic modalities to forestall the appearance of 
schizophrenia symptoms in adolescence or adulthood. 



20 Empirical studies with methylfolate treatment of schizophrenia have shown modest 
clinical improvement. The DNA Polymorphism-Diet-Cofactor-Development model 
gives a rationale for such therapy as well as for intensive testing of related therapeutic 
modalities. Genetic testing will need to be carried out in such patients to gauge their 
likelihood of responding to therapy. In addition, the DNA 

25 Polymorphism-Diet-Cofactor-Development model gives direction and impetus 
toward uncovering the mechanism of fetal brain damage leading to schizophrenia. 



Diagnostic testing for schizophrenia can involve testing not just the patient, but 
mother and father as well, for not just one factor but multiple genetic factors. For 
example, data for two gene loci (both folate-related genes) were used in Example 2. 
30 In this case, there were only four explanatory variables for each comparison. 




25 



In addition, risk factors appearing only during pregnancy may play a role, e.g. dietary 
folate which can be further monitored during the pregnancy. In certain instances, 
genotype data can be used as the sole explanatory variables, particularly in the case 
when no environmental explanatory variables are known. In such a case, the 
5 predicted probabilities will be only for the genetic component of the proband's risk of 
schizophrenia. In addition, schizophrenia mothers, fathers, and sibs do not 
necessarily have to come from the same families as the schizophrenia probands, as 
described in Example 2. 

Of course certain genetic factors will turn out to be more common than others. This 
10 may simplify testing somewhat. Also some genetic factors may operate chiefly in the 
mother, while others will operate chiefly in the schizophrenic patient. This may also 
simplify testing. There are some approaches to assessing risk factors during a past 
pregnancy, e.g. current dietary history as an indicator of past diet, methionine loading 
as in indicator of how susceptible a mother is to raising her plasma homocysteine, 
15 assessment of other risk factors besides folate metabolism that may affect pregnancy 
outcome. Procedures including all of these variables are both envisioned and 
included in the present invention. 

Thus the present invention provides a method of diagnosis of schizophrenia. In one 
aspect of the invention, diagnostic testing for genetic susceptibility to schizophrenia 

20 determines the probability that the proband is affected with schizophrenia due to 

genetic factors. This is carried out by genetic testing of a patient suspected of having 
schizophrenia and/or whatever informative relatives are available, e.g. mother, father, 
sibs, or children. The genotypes of certain folate and/or cobalamin and/or pyridoxine 
gene mutations or constellation of mutations (folate and/or cobalamin and/or 

25 pyridoxine gene mutations) are determined for each individual. 

Since the abnormal phenotype of schizophrenia can be determined by both genetic 
and environmental factors and since other genetic factors besides 
folate/cobalamin/pyridoxine gene mutations may be involved, the presence of 
folate/cobalamin/pyridoxine gene mutations may be neither necessary nor sufficient 
30 to cause schizophrenia. Thus, an unaffected individual may have the same genetic 
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risk factors as an affected individual but may lack sufficient environmental factors to 
cause the abnormal clinical disease. Also, an affected individual may lack 
folate/cobalamin/pyridoxine gene mutations but may have other related or non-related 
genetic risk factors that caused the schizophrenia. 

5 Therefore folate/cobalamin/pyridoxine gene mutations are used as explanatory 

variables (genetic risk factors) to calculate the predicted probability that an individual 
has genetic susceptibility to schizophrenia due to these mutations. Genetic variation 
can be expected to account for approximately about half of the risk of developing 
schizophrenia since the concordance rate in identical twins has been estimated to be 

10 about 50%. The other half of the risk results from environmental factors due to their 
different positions in the uterus and to differences in the blood supply. The use of 
environmental factors as additional explanatory variables enhances this probability 
calculation, although this environmental data is more difficult to gather. Together, 
using both genetic and environmental explanatory variables, the predicted probability 

15 that an individual is schizophrenic may approach 1.0. 

One likely situation for the use of the present methodology is in the diagnosis of a 
patient that has developed a psychosis. In such a case, the clinician is likely to be 
interested in determining the probability that this individual has schizophrenia. The 
number of blood relatives (preferably first degree relatives) of the patient-to-be 

20 diagnosed, both unaffected and affected, could then be determined. The number of 
these who would contribute a blood sample for analysis, for example, could then be 
ascertained. It is preferable that the patient-to-be-diagnosed also contributes a blood 
sample, however in certain situations, this may not be an option. The availability of 
dietary and epidemiological information for environmental explanatory variables, 

25 especially from the patient and the mother, can also ascertained. Of course all 

relevant legal and ethical rules should be followed regarding informed consent for the 
genetic testing. 
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Biological samples such as tissue or fluid samples (e.g., 7 ml of blood in an 
EDTA-containing vacutainer, see Example 2, below), and obtainable environmental 
data from the patient and family members are then collected. DNA is extracted from 



27 



the sample and genotypes for alleles of folate and/or cobalamin and/or pyridoxine 
genes are determined. The methods for genotyping depend upon the specific genetic 
markers used as explanatory variables. The methods for allele determination for two 
genetic markers are discussed in the Examples below. 



5 Data of the genetic and environmental explanatory variables for the 

patient-to-be-diagnosed (proband) and participating family members are added to a 
reference data set preferably consisting of well-defined schizophrenia probands and 
family members, and control probands, and family members for whom data is 
available for many explanatory variables. As an approximation the control probands 
10 themselves also can be used as the controls for each proband family member class as 
SJ shown in Example 2, below. Thus, as an approximation the control probands can be 

is i used as controls for the affected probands; and/or separately for the mothers of 

affected probands; and/or separately for the fathers of affected probands, etc. Another 
s example of a use of the control probands is in the evaluation and/or analysis of a 

^ 15 particular diagnostic proband. In this case, the approximation is obtained by adding 

Hi the diagnostic proband to the group of affected probands and control probands. 



A model is then created consisting of the explanatory variables actually available 
from specific patient-to-be diagnosed and family members participating in the testing. 
This new combined data set (reference data set and data from patient-to-be-diagnosed 
20 with participating family members) is analyzed by binary logistic regression (e.g., 
using a statistical software package such as the SAS System embodied in Example 1 
below, though other programs may be used) for the model chosen giving the 
predicted probability that a proband is affected with schizophrenia for all of the 
probands including the patient-to-be-diagnosed. 



25 In a particular embodiment the model is modified and the goodness of fit for the 
patient-to-be-diagnosed is checked. The predicted probability that the 
patient-to-be-diagnosed has schizophrenia is compared with a classification table 
generated from the model used to determine the likelihood of false positives and false 
negatives. 
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The predicted probability that the patient-to-be-diagnosed is affected with 
schizophrenia, with the likelihood of false positive or false negative result, can then 
be forwarded to the clinician. 

The methods for determining an individual's risk for developing schizophrenia taught 
5 by the present invention can be used in a variety of settings. For example, the present 
invention also provides a therapy for schizophrenia. Empirical studies with 
methylfolate treatment of schizophrenia have shown modest clinical improvement. 
The DNA Polymorphism-Diet-Cofactor-Development model provides a rationale for 
such therapy as well as for intensive testing of related therapeutic modalities, e.g. 
10 other cofactors such as cobalamin or pyridoxine. In addition, the DNA 

Polymorphism-Diet-Cofactor-Development model gives direction and impetus 
toward uncovering the mechanism of fetal brain damage leading to schizophrenia. Of 
course such therapy also can be provided on a case by case basis in order to gauge the 
likelihood of the patient of responding to such therapy, with the methodology for 



fU 15 diagnosis of the present invention enabling the skilled practitioner to assess that 

us 

ri likelihood. 



In addition, the present invention provides a method of identifying individuals that 
are likely to be aided by presymptomatic treatment for schizophrenia. For example, 
young children found to have a high risk for susceptibility to schizophrenia by 
20 diagnostic testing can be treated with methylfolate or related therapeutic modalities to 
forestall the appearance of schizophrenia symptoms in adolescence or adulthood. 
The present invention further provides methodology for diagnostic testing for specific 
families already affected by schizophrenia. 

The present invention further provides methodology for population screening for 
25 folate/cobalamin/pyridoxine mutations to help identify couples at risk for producing 
schizophrenic offspring. Subsequent or concurrent pregnancies can then be 
monitored for environmental risk factors, and treated with folate, cobalamin, 
pyridoxine or other agents and monitored at intervals for the effect of therapy. Such 
monitoring can include measuring levels of folate, cobalamin, pyridoxine or 
30 homocysteine in a particular tissue and/or fluid sample, such as blood. 
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Since schizophrenia is a developmental disorder, it is likely that these same risk 
factors discussed here for schizophrenia could play a role in other developmental 
disorders including spina bifida cystica, Tourette's syndrome, learning disorders 
5 including dyslexia, conduct disorder, attention-deficit hyperactivity disorder, bipolar 
illness, autism, and obsessive-compulsive disorder. Interestingly, the mode of 
inheritance of these disorders, like that of schizophrenia, has been difficult to 
determine despite the fact that a genetic component to the etiology of each has been 
documented. Therefore, methodology analogous to that exemplified herein for 
10 schizophrenia can be readily adapted for diagnosing and/or treating other such 
developmental disorders. 

Nucleic Acids 

In accordance with the present invention there may be employed conventional 
molecular biology, microbiology, and recombinant DNA techniques within the skill 

15 of the art. Such techniques are explained fully in the literature. See, e.g., Sambrook, 
Fritsch & Maniatis, Molecular Cloning: A Laboratory Manual, Second Edition 
(1989) Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New York (herein 
"Sambrook et al, 1989"); DNA Cloning: A Practical Approach, Volumes I and II 
(D.N. Glover ed. 1985); Oligonucleotide Synthesis (M.J. Gait ed. 1984); Nucleic Acid 

20 Hybridization [B.D. Hames & SJ. Higgins eds. (1985)]; Transcription And 

Translation [B.D. Hames & SJ. Higgins, eds. (1984)]; Animal Cell Culture [R.I. 
Freshney, ed. (1986)]; Immobilized Cells And Enzymes [IRL Press, (1986)]; 
B. Perbal, A Practical Guide To Molecular Cloning (1984); F.M. Ausubel et al 
(eds.), Current Protocols in Molecular Biology, John Wiley & Sons, Inc. (1994)]. 

25 A "nucleic acid molecule" refers to the phosphate ester polymeric form of 

ribonucleosides (adenosine, guanosine, uridine or cytidine; "RNA molecules") or 
deoxyribonucleosides (deoxyadenosine, deoxyguanosine, deoxythymidine, or 
deoxycytidine; "DNA molecules"), or any phosphoester analogs thereof, such as 
phosphorothioates and thioesters, in either single stranded form, or a double-stranded 



30 

helix. Double stranded DNA-DNA, DNA-RNA and RNA-RNA helices are possible. 
The term nucleic acid molecule, and in particular DNA or RNA molecule, refers only 
to the primary and secondary structure of the molecule, and does not limit it to any 
particular tertiary forms. Thus, this term includes double-stranded DNA found, inter 
5 alia, in linear or circular DNA molecules including restriction fragments, plasmids, 
and chromosomes. In discussing the structure of particular double-stranded DNA 
molecules, sequences may be described herein according to the normal convention of 
giving only the sequence in the 5' to 3' direction along the nontranscribed strand of 
DNA {i.e., the strand having a sequence homologous to the mRNA). A "recombinant 
10 DNA molecule" is a DNA molecule that has undergone a molecular biological 
manipulation. 

A nucleic acid molecule is "hybridizable" to another nucleic acid molecule, such as a 
cDNA, genomic DNA, or RNA, when a single stranded form of the nucleic acid 
molecule can anneal to the other nucleic acid molecule under the appropriate 

15 conditions of temperature and solution ionic strength {see Sambrook et aL, supra). 
The conditions of temperature and ionic strength determine the "stringency" of the 
hybridization. High stringency hybridization conditions correspond to 50% 
formamide, 5x or 6x SSC. Hybridization requires that the two nucleic acids contain 
complementary sequences, although depending on the stringency of the hybridization, 

20 mismatches between bases are possible. The appropriate stringency for hybridizing 
nucleic acids depends on the length of the nucleic acids, the GC percentage, and the 
degree of complementation, variables well known in the art. The greater the degree 
of similarity or homology between two nucleotide sequences, the greater the value of 
T m for hybrids of nucleic acids having those sequences. The relative stability 

25 (corresponding to higher T m ) of nucleic acid hybridizations decreases in the following 
order: RNA: RNA, DNA:RNA, DNA:DNA. For hybrids of greater than 100 
nucleotides in length, equations for calculating T m have been derived {see Sambrook 
et al , supra, 9.50-10.5 1). For hybridization with shorter nucleic acids, i.e., 
oligonucleotides, the position of mismatches becomes more important, and the length 

30 of the oligonucleotide determines its specificity {see Sambrook et al, supra, 1 1 .7- 

1 1.8). Preferably a minimum length for a hybridizable nucleic acid (e.g., a nucleotide 
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probe or primer such as a PCR or RT-PCR primer) is at least about 12 nucleotides; 
preferably at least about 18 nucleotides; and more preferably the length is at least 
about 27 nucleotides; and most preferably at least about 36 nucleotides. Specific 
probes and primers that can be used to distinguish specific variants of the nucleic 
5 acids encoding the proteins involved in folate, pyridoxine, and/or cobalamin 
metabolism are also part of the present invention. 

Such nucleotide probes and primers can be labeled or used to label complementary 
DNA (where appropriate) by any number of ways well known in the art including 
using a radioactive label, such as 3 H, 14 C, 32 P, or 35 S, a fluorescent label, a boron label 
[U.S. Patent No: 5,595,878, Issued January 21, 1997 and U.S. Patent No: 5,876,938, 
Issued March 2, 1999 which are incorporated by reference in their entireties], and 
enzymatic tags such as urease, alkaline phosphatase or peroxidase. In the case of 
enzyme tags, colorimetric indicator substrates are known which can be employed to 
provide a means visible to the human eye or spectrophotometrically, to identify 
specific hybridization with complementary nucleic acid-containing samples. 

In a specific embodiment, the term "standard hybridization conditions" refers to a T m 
of 55 °C, and utilizes conditions as set forth above e.g., 5X SSC. In a preferred 
embodiment, the T m is 60 °C; in a more preferred embodiment, the T m is 65 °C. 

A DNA "coding sequence" is a double-stranded DNA sequence which is transcribed 
20 and translated into a polypeptide in a cell in vitro or in vivo when placed under the 
control of appropriate regulatory sequences. The boundaries of the coding sequence 
are determined by a start codon at the 5' (amino) terminus and a translation stop 
codon at the 3 ' (carboxyl) terminus. A coding sequence can include, but is not 
limited to, prokaryotic sequences, cDNA from eukaryotic mRNA, genomic DNA 
25 sequences from eukaryotic {e.g., mammalian) DNA, and even synthetic DNA 

sequences. If the coding sequence is intended for expression in a eukaryotic cell, a 




10 



15 
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polyadenylation signal and transcription termination sequence will usually be located 
3' to the coding sequence. 

"Transcriptional and translational control sequences" are DNA regulatory sequences, 
such as promoters, enhancers, terminators, and the like, that provide for the 
5 expression of a coding sequence in a host cell. In eukaryotic cells, polyadenylation 
signals are control sequences. 

A "promoter sequence" is a DNA regulatory region capable of binding RNA 
polymerase and initiating transcription of a downstream (3 ' direction) coding 
tJi sequence. For purposes of defining the present invention, the promoter sequence is 

Si 1 0 bounded at its 3 ' terminus by the transcription initiation site and extends upstream (5 ' 

ni = ... 

«; direction) to include the minimum number of bases or elements necessary to initiate 

P transcription at levels detectable above background. Within the promoter sequence 

£3 will be found a transcription initiation site (conveniently defined for example, by 

?*; mapping with nuclease SI), as well as protein binding domains (consensus 

y j 1 5 sequences) responsible for the binding of RNA polymerase. 

A "signal sequence" is included at the beginning of the coding sequence of a protein 
to direct the protein to a particular site/compartment in the cell such as the surface of 
a cell. This sequence encodes a signal peptide, N-terminal to the mature polypeptide, 
that directs the host cell to translocate the polypeptide. The term "translocation signal 
20 sequence" is used herein to refer to this sort of signal sequence. Translocation signal 
sequences can be found associated with a variety of proteins native to eukaryotes and 
prokaryotes, and are often functional in both types of organisms. 

Identification of Genetic Mutations 



25 



A biological sample can be obtained from an individual and/or a blood relative of the 
individual, and from appropriate controls, using a sample from any body component 
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including tissue punches, body fluids, and hair, as long as the biological sample 
contains nucleic acids and/or proteins/pep tides. Thus the DNA, mRNA, proteins or 
peptides of the biological sample can be used to identify mutations and/or variants in 
genes involved in folate, pyridoxine, or cobalamine metabolism. The present 
5 invention therefore includes methods of detecting and quantifying these nucleic acids 
and/or proteins/peptides that can be used to identify genetic risk factors. 

In a particular embodiment the DNA is extractable. A particularly useful source of 
DNA is blood. For example, 2.5- 40 mis of blood can be collected in a vacutainer 
containing EDTA. The blood sample is placed on ice and then centrifuged to separate 
10 plasma, red cells, and buffy coat. The separated fractions are then frozen at -80°C. 

The DNA can be isolated from the buffy coat by a number of procedures well known 
in the art including using a QIAmp column DNA extraction procedure or the 
QIAGEN Genomic-tip method. The isolated DNA can be digested with a series of 
restriction enzymes, for example, and then the digested products can be hybridized 
15 with one or more particular nucleic acid probes designed from a particular gene to 
identify the gene and preferably to test for particular genetic mutations. 

Preferably the genomic DNA can be amplified by PCR using appropriate primer pairs 
such as the primer pairs for the MTHFR or DHFR genes which were used in the 
Example below. The PCR amplified product can be sequenced directly, or 

20 alternatively be digested with one or more appropriate restriction enzymes. The 
resulting digested products can be separated e.g., by column chromatography, or 
preferably by polyacrylamide or agarose gel electrophoresis. The isolated digestion 
products can be compared e.g., by previously determined restriction maps, and/or 
alternatively, the digestion products can be sequenced directly. Alternatively, as in 

25 the case of DHFR, genetic polymorphisms can be detected through the use of 
restriction enzymes. 
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Although a restriction map of a gene is sufficient for the employment of the methods 
disclosed herein, in preferred embodiments the nucleotide sequences of the genes 
used in the testing steps are known. To this end a large sampling of such sequences 
are provided in Tables 2-7. (These sequences may also be used in the design of 
restriction maps.) Thus, initially each gene whether used separately or used in a 
constellation of genes is characterized by the sequencing of the wild type gene, 
preferably including the coding regions, introns, control sequences, and other non- 
coding regions. In addition, mutations of such genes found in the general population 
can also be characterized. With the recent advances in the sequencing of the human 
genome the present invention contemplates that additional sequence information will 
become publicly available, particularly with regard to mutations in relevant introns, 
and control sequences etc. which are not available in cDNA libraries. Such sequence 
information is fully envisioned to be incorporated into the on-going compilations of 
relevant DNA sequence databases of the present invention, as well as for its parallel 
use in the general methodology described herein. Thus DNA or mRNA or cDNA 
made from the mRNA can be used to identify mutations and/or variants in genes 
involved in folate, pyridoxine, or cobalamine metabolism. 

There are many methods currently known in the art to identify variant/mutant DNA, 
all of which may be\used in the present invention (see e.g. , internet address 
http://www.ich. bpmf.ac\jk/cmgs/mutdet.htm). Such methods include but in no way are 
limited to direct sequencing, array sequencing, matrix-assisted laser 
desorption/ionization time-of- flight mass spectrometry (Malditof) [Fitzgerald et al. , 
Ann. Rev. Biophy. BiomolSStruct. 24:117-140 (1995)], Polymerase Chain Reaction 
"PCR" reverse-transcriptasfe Polymerase Chain Reaction "RT-PCR", RNAase 
protection assays, Array quantitation e.g., as commercially provided by Affymetrix, 
Ligase Chain Reaction or Ligase Amplification Reaction (LCR or LAR), 
Self-Sustained Synthetic Reactiofi (3SR/NASBA), Restriction Fragment Length 
Polymorphism (RFLP),Cycling Pr0\be Reaction (CPR), Single-Strand Conformation 
Polymorphism (SSCP), heteroduplexsanalysis, hybridization mismatch using 
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nucleases (e.g., cleavase)/\Southern, Northerns, Westerns, South Westerns, ASOs, 
Molecular beacons, footprinting, and Fluorescent In Situ Hybridization (FISH). Some 
of these methods are briefly described below. 

PCR is a method for increasing the concentration of a segment of target sequence in a 
5 mixture of genomic DNA without cloning or purification. PCR can be used to 
directly increase the concentration of the target to an easily detectable level. This 
process for amplifying the target sequence involves introducing a molar excess of two 
oligonucleotide primers which are complementary to their respective strands of the 
double-stranded target sequence to the DNA mixture containing the desired target 

10 sequence. The mixture is denatured and then allowed to hybridize. Following 

hybridization, the primers are extended with polymerase so as to form complementary 
strands. The steps of denaturation, hybridization, and polymerase extension can be 
repeated in order to obtain relatively high concentrations of a segment of the desired 
target sequence. The length of the segment of the desired target sequence is 

15 determined by the relative positions of the primers with respect to each other, and, 
therefore, this length is a controllable parameter. Because the desired segments of the 
target sequence become the dominant sequences (in terms of concentration) in the 
mixture, they are said to be "PCR-amplified." [Mullis (U.S. Patent No. 4,683,195) 
and Mullis et al. (U.S. Patent No. 4,683,202)] 



20 In Ligase Chain Reaction or Ligase Amplification Reaction (LCR or LAR) four 
oligonucleotides, two adjacent oligonucleotides which uniquely hybridize to one 
strand of target DNA, and a complementary set of adjacent oligonucleotides, which 
hybridize to the opposite strand are mixed and DNA ligase is added to the mixture. 
Provided that there is complete complementarity at the junction, ligase will covalently 

25 link each set of hybridized molecules. Importantly, in LCR, two probes are ligated 
together only when they base-pair with sequences in the target sample, without gaps 
or mismatches. Repeated cycles of denaturation, hybridization and ligation amplify a 
short segment of DNA. [Barany, Proc. Natl. Acad. Sci., 88:189 (1991); Barany, PCR 
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Methods and Applic., 1:5 (1991); and Wu and Wallace, Genomics 4:560 (1989)] 
LCR has also been used in combination with PCR to achieve enhanced detection of 
single-base changes. Segev, PCT Public. No. W09001069 Al (1990). 

Self-Sustained Synthetic Reaction (3SR/NASBA) is a transcription-based in vitro 
5 amplification system [Guatelli et al, Proc. Natl Acad. Sci., 87:1874-1878, 7797 
(1990); Kwok et al,Proc. Natl Acad. Sci., 86: 1 173-1 177) that can exponentially 
amplify RNA sequences at a uniform temperature. The amplified RNA can then be 
utilized for mutation detection (Fahy et al, PCR Meth. AppL, 1:25-33 (1991). In this 
y method, an oligonucleotide primer is used to add a phage RNA polymerase promoter 

Ul 10 to the 5' end of the sequence of interest. In a cocktail of enzymes and substrates that 

%i includes a second primer, reverse transcriptase, RNase H, RNA polymerase and 

Iji ribo-and deoxyribonucleoside triphosphates, the target sequence undergoes repeated 

01 rounds of transcription, cDNA synthesis and second-strand synthesis to amplify the 

pj area of interest. 

y j 15 RFLP can be used to detect DNA polymorphisms arising from DNA sequence 

p% variation. This method consists of digesting DNA with one or more restriction 

endonucleases {e.g., EcoRI) and analyzing the resulting fragments by means of 
Southern blots [Southern, E., Methods in Enzymology, 69:152 (1980)], as further 
described by Botstein, et al., Am. J. Hum. Genet., 32:314-331 (1980) and White, et 
20 al., Sci. Am., 258:40-48 (1988). Since a DNA polymorphism may create or delete a 
restriction site, the length of the corresponding restriction fragment with any given 
restriction enzyme could change. Once a difference in a restriction fragment length is 
identified it can be used to readily distinguish a particular polymorphism from the 
wild type DNA. Mutations that affect the recognition sequence of the endonuclease 
25 will preclude enzymatic cleavage at that site, thereby altering the cleavage pattern of 
that DNA. DNAs are compared by looking for differences in restriction fragment 
lengths. A technique for detecting specific mutations in any segment of DNA is 
described in Wallace, et al,[Nucl Acids Res., 9:879-894 (1981)]. It involves 
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hybridizing the DNA to be analyzed (target DNA) with a complementary, labeled 
oligonucleotide probe. Due to the thermal instability of DNA duplexes containing 
even a single base pair mismatch, differential melting temperature can be used to 
distinguish target DNAs that are perfectly complementary to the probe from target 
5 DNAs that differ by as little as a single nucleotide. In a related technique, described in 
Landegren, et ai, Science, 41:1077-1080 (1988), oligonucleotide probes are 
constructed in pairs such that their junction corresponds to the site on the DNA being 
analyzed for mutation. These oligonucleotides are then hybridized to the DNA being 
analyzed. Base pair mismatch between either oligonucleotide and the target DNA at 

y9 1 0 the junction location prevents the efficient joining of the two oligonucleotide probes 

*j byDNAligase. 

gi When a sufficient amount of a nucleic acid to be detected is available, there are 

j 6 advantages to detecting that sequence directly, instead of making more copies of that 

Q target, (e.g., as in PCR and LCR). Most notably, a method that does not amplify the 

U1 

ry 15 signal exponentially is more amenable to quantitative analysis. Even if the signal is 

%l enhanced by attaching multiple dyes to a single oligonucleotide, the correlation 

p between the final signal intensity and amount of target is direct. Such a system has an 

additional advantage that the products of the reaction will not themselves promote 
further reaction, so contamination of lab surfaces by the products is not as much of a 
20 concern. Traditional methods of direct detection including Northern and Southern 
blotting and RNase protection assays usually require the use of radioactivity and are 
not amenable to automation. Recently devised techniques have sought to eliminate 
the use of radioactivity and/or improve the sensitivity in automatable formats. 



One such example is the Cycling Probe Reaction (CPR) [Duck et aL, BioTech., 9:142 
25 (1990)]. CPR , uses a long chimeric oligonucleotide in which a central portion is 

made of RNA while the two termini are made of DNA. Hybridization of the probe to 
a target DNA and exposure to a thermostable RNase H causes the RNA portion to be 
digested. This destabilizes the remaining DNA portions of the duplex, releasing the 
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remainder of the probe from the target DNA and allowing another probe molecule to 
repeat the process. The signal, in the form of cleaved probe molecules, accumulates 
at a linear rate. While the repeating process increases the signal, the RNA portion of 
the oligonucleotide is vulnerable to RNases that may carried through sample 
5 preparation. 

Single-Strand Conformation Polymorphism (SSCP) is based on the observation that 
single strands of nucleic acid can take on characteristic conformations in 
non-denaturing conditions, and these conformations influence electrophoretic 
mobility. [Hayashi, PCR Meth. Appl., 1:34-38, (1991). The complementary strands 
assume sufficiently different structures that one strand may be resolved from the 
other. Changes in sequences within the fragment will also change the conformation, 
consequently altering the mobility and allowing this to be used as an assay for 
sequence variations (Orita, et aL 9 Genomics 5:874-879, (1989). The SSCP process 
involves denaturing a DNA segment (e.g., a PCR product) that is labeled on both 
strands, followed by slow electrophoretic separation on a non-denaturing 
polyacrylamide gel, so that intra-molecular interactions can form and not be disturbed 
during the run. This technique is extremely sensitive to variations in gel composition 
and temperature. 

In Fluorescent In Situ Hybridization (FISH), specific probes are designed which can 
20 readily distinguish the wild-type gene from the variant/mutant gene. Such 

methodology allows the identification of a variant/mutant gene through in situ 
hybridization (U.S. Patent No. 5,028,525, Issued July 2, 1991; U.S. Patent No. 
5,225,326, Issued July 6, 1993; and U.S. Patent No. 5,501,952, Issued March 26, 
1996. FISH does not require the extraction of DNA. In addition, procedures for 
25 separating fetal blood cells from maternal blood cells are well known in the art 

allowing the fetus and the mother to be analyzed from the same body fluid sample 
(see U.S. Patent No: 5,629,147, Issued May 13, 1997). 



y.1 
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Similarly, antibodies raised against specific mutations and/or variants in the gene 
products of the genes involved in folate, pyridoxine, or cobalamine metabolism can 
be used to identify specific polymorphisms. Alternatively, antibodies raised against 
the wild type proteins can be used to detect and/or quantify the amount of wild type 
5 protein present in a given biological sample. In the case in which cross-reacting 
protein isn't synthesized by the cells of an individual, or is synthesized in 
significantly lower amounts than those of control subjects, such determinations can be 
used to identify a genetic risk factor. In addition, these antibodies can be used in 
methods well known in the art relating to the localization and activity of the gene 
10 products, e.g., for Western blotting, imaging the proteins in situ, measuring levels 
W| thereof in appropriate physiological samples, etc. using any of the detection 

Sj techniques known in the art. Furthermore, such antibodies can be used in flow 

cytometry studies, in immunohistochemical staining, and in immunoprecipitation 
0 1 which serves to aid the determination of the level of expression of a protein in the cell 

p 15 or tissue. 

Ml 

yj In the particular instance when the gene product is an enzyme, e.g., dihydro folate 

j*3 reductase, the enzymatic activity of a biological sample can be indicative of the 

presence of a genetic risk factor. In a particular embodiment, a decrease in an enzyme 

activity that is associated with folate, pyridoxine, or cobalamine metabolism can be 
20 indicative of the presence of the genetic risk factor. Such assays can be performed on 

multiple samples such as on a microplate reader [Widemann et al, Clin Chem. 

45:223-228 (1999)]. 



MODEL 1 

The Gene-Teratogen Model for the Inheritance Pattern of Certain Developmental 
25 Disorders 

Introduction : 

It has long been known, e.g. from extensive studies of exogenous teratogens in inbred 
mice [Finnell and Chernoff, Gene-teratagen interactions: an approach to 
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understanding the metabolic basis of birth defects, In Pharmacokinetics in 
Teratogenesis,Vol. 11:97-109 Experimental Aspects In Vivo and In Vitro, CRC Press, 
Inc, Boca Ratan, Fl. (1987)], that teratogens may be influenced by genetic factors. It 
is less well known that the same gene defect may cause different clinical disorders 
5 depending upon whether the metabolic effect of the gene defect is exerted during 
gestation in utero or during postnatal life. However, the consequences of 
gene-teratogen interactions in human pedigrees have not been extensively explored, 
especially the consequences for the use of linkage mapping to identify an unknown 
gene acting in utero to cause a developmental disorder. A number of common human 

10 developmental disorders have been shown to have a genetic component to their 

etiology. However, for certain developmental disorders, the mode of inheritance has 
been difficult to determine and linkage studies have met with unexpected difficulties 
or have achieved limited success. These developmental disorders include spina bifida 
cystica [Chatkupt, Am J Med Genet, 44:508-512 (1992)], Tourette's syndrome & 

15 related disorders, e.g. obsessive-compulsive disorder and chronic multiple tics 
syndrome [Pauls, Adv Neurol, 58:151-157 (1992); McMahon etal, Adv Neurol, 
58:159-165 (1992); Heutink etal, Am J Hum Genet, 57:465-473 (1995); Grice etal, 
Am J Hum Genet, 59:644-652 (1996)], learning disorders, including dyslexia [Lewis, 
et al, Behav Genet, 23:291-297 (1993); Pennington, J Child Neurol 10 Suppl, 1:S69- 

20 S77 (1995)], conduct disorder [Lombroso et al, J Am Acad Child Adolesc Psychiatry, 
33:921-938 (1994)], attention-deficit hyperactivity disorder [Lombroso et al, J Am 
Acad Child Adolesc Psychiatry, 33:921-938 (1994)], bipolar illness [Baron, Acta 
Psychiatr Scand, 92:81-86 (1995); Benjamin and Gershon, Biol Psychiatry, 40:313- 
316 (1996); Risch and Botstein, Nature Genet, 12:351-353 (1996); Jamison and 

25 Mclnnis, Nature Med, 2:521-522 (1996); Morell, Science, 272:31-32 (1996)], 

schizophrenia [Owen, Psychol Med, 22:289-293 (1992); Cloninger, Am J Med Genet, 
54:83-92(1994); Lander and Kruglyak, Nature Genet, 11:241-247 (1995); Baron, 
Acta Psychiatr Scand, 92:81-86 (1995); Benjamin and Gershon, Biol Psychiatry, 
40:313-316 (1996); Baron, Am J Med Genet, 67:121-123 (1996)], autism [Lombroso 

30 et al, J Am Acad Child Adolesc Psychiatry, 33:921-938 (1994)], and 
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obsessive-compulsive disorder in adults [Lombroso et al., J Am Acad Child Adolesc 
Psychiatry, 33:921-938 (1994)]. A recent article [Moldin, Nature Genet. 17:127-129 
(1997)] has reviewed "The maddening hunt for madness genes." 



The present model addresses the question of the mode of inheritance of certain 
5 developmental disorders and proposes the "gene-teratogen model." The model 

suggests that the mode of inheritance of genes acting prenatally may in some cases be 
fundamentally different from that of genes acting postnatally. Even the same gene 
acting prenatally may produce a different disorder from that gene acting postnatally. 
The inheritance pattern in the gene-teratogen model is simple, but from the 
1 0 perspective of the patient with the developmental disorder is neither dominant nor 
recessive. Some disorders regarded as multifactorial, polygenic, or oligogenic may 
have this mode of inheritance. In the gene-teratogen model, genetically determined 
teratogen production by the mother during pregnancy damages the fetus producing 
Ul the abnormal phenotype of a developmental disorder. The model is illustrated with 

y 15 two types of loci, 1. a teratogenic locus acting in the mother, and 2. a modifying or 

^zz specificity locus acting in the fetus. Damage by the teratogen is influenced also by 

environmental factors. The model is interesting because it is simple and because 
teratogenic loci will be difficult to locate by parametric or non-parametric linkage 
mapping tecliniques due to misspecification of the affection status of both mother and 
20 affected children. A study design is suggested for identifying teratogenic loci. An 

example of the gene-teratogen model is the major intrauterine effect seen in offspring 
of phenylketonuric mothers. Certain developmental disorders whose mode of 
inheritance has been difficult to determine or whose genetic factors have been 
difficult to locate are candidates for the gene-teratogen model, including spina bifida 
25 cystica, Tourette's syndrome, learning disorders including dyslexia, conduct disorder, 
attention-deficit hyperactivity disorder, bipolar illness, schizophrenia, autism, and 
obsessive-compulsive disorder. 
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The Gene Teratogen Model 



The model is described in Table 1 using two kinds of loci: a "teratogenic" locus and a 
"modifying" or "specificity" locus. The gene-teratogen model requires a teratogenic 
locus. One or more modifying or specificity loci may or may not be present. Also, 
two types of phenotypes are defined: 1. the teratogen-induced phenotype; and 2. the 
teratogenic phenotype, i.e., the phenotype of a mother that produces a teratogenic 
effect during pregnancy. The two phenotypes are different for the teratogenic locus 
but are identical for the modifying or specificity loci. 



TABLE 1 

DIAGRAM OF THE GENE-TERATOGEN MODEL 



Grandparents: 


Maternal 
Grandmother 
AabbCCdd 


Maternal 
Grandfather 
AaBbCcdd 


Paternal 
Grandmother 
AAbbCcDd 


Paternal 
Grandfather 
AAbbCCdd 


Parents: 


Mother 
aaBbCcdd 


Father 
AAbbCcDd 


Child: 


Child (fetus) with developmental disorder 
AabbccDd 


locus A: 


teratogenic locus, recessive, acting in the mother to cause 
intrauterine teratogenic damage to the fetus. 


locus B: 


teratogenic locus, dominant, acting in the mother to cause 
intrauterine teratogenic damage to the fetus. 


locus C: 


modifying or specificity locus, recessive, acting in the fetus. 


locus D: 


modifying or specificity locus, dominant, acting in the fetus. 



The teratogenic locus may be dominant (locus A) or recessive (locus B). This locus 
acts in the mother during pregnancy to cause an intrauterine teratogenic effect in the 
fetus. The teratogenic effect may result from the production of an endogenous 
teratogen, from potentiation of an exogenous teratogen, from a metabolic deprivation 
or imbalance or from some other mechanism. Only one teratogenic locus is required; 
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both locus A and locus B are shown on the same diagram for simplicity. A specificity 
or modifying locus may be dominant (locus C) or recessive (locus D). Such a locus 
acts during pregnancy or after to modify the extent of the developmental damage 
done by the teratogenic locus or even to prevent or repair the damage. For example, 
5 for a teratogen acting at a certain time in development, locus C or D may determine 
whether brain or kidney is damaged, which structures of the brain are damaged, or 
whether damage occurs at all. 



1. Locus A, recessive teratogenic locus, acting in the mother: The child is the patient 
with the abnormal phenotype of a specific developmental disorder, while mother, 

10 father, and grandparents do not have the abnormal phenotype of that disorder (Table 
1). Locus A acts in the mother during pregnancy causing her to produce the 
teratogenic effect that damages the developing fetus leading to the developmental 
disorder either in the fetus or postnatally in the child or adult. Since this locus is 
recessive in action, the mother, a homozygote (aa) for the disease allele, is the genetic 

15 "patient." Her abnormal phenotype, the "teratogenic phenotype", is the trait of 
producing the teratogenic effect during pregnancy. Her fetus, damaged by the 
teratogenic effect in utero, does develop the teratogen-induced phenotype. However, 
the fetus is only a heterozygote (Aa) at locus A and thus lacks both the abnormal 
homozygous genotype at locus A and the abnormal teratogenic phenotype; e.g., if the 

20 fetus is a daughter, she will not produce the teratogenic effect later during pregnancy. 
Thus, the fetus is affected with the developmental disorder but is not the genetic 
"patient." Locus A, acting through a teratogenic effect, cannot be the only etiological 
factor for the developmental disorder. If it were, then all pregnancies of an aa mother 
would have the teratogen-induced phenotype which is not the case. Environmental 

25 and/or other genetic factors, are required. An aa father will have the abnormal 

genotype, but not the abnormal teratogenic phenotype because he could never become 
pregnant. 
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2. Locus B, dominant teratogenic locus acting in the mother: The situation is the 
same as for locus A except that locus B is dominant in action (Table 1). The mother 
has the abnormal genotype, Bb, and the abnormal teratogenic phenotype. The fetus 
has the teratogen-induced phenotype but in the instance shown (Table 1) has neither 

5 the abnormal genotype, the teratogenic phenotype, nor even a copy of the disease 
allele. The maternal grandfather shown (Table 1) has the abnormal genotype, Bb, but 
does not have the teratogenic phenotype because he could never become pregnant. 

3. Environmental effects: The teratogenic effect is modified by environmental 
factors, e.g. maternal dietary factors, infection, or ingestion of teratogen. These 
environmental factors may interact with locus A or B or may act independently. From 
the perspective of the fetus later to develop the developmental disorder 
(teratogen-induced phenotype), intrauterine teratogenic is an environmental not a 
genetic effect. 

4. Modifying or Specificity Loci Acting in the Fetus, Loci C & D: These loci may 
interact with the teratogenic locus or the environmental factors to increase or decrease 
their effect, or alternatively could act independently. Such genetic factors may be 
recessive (locus C) or dominant (locus D). Genotypes and phenotypes of locus C and 
D behave conventionally with respect to the developmental disorder. For locus C and 
D, the fetus is with the developmental disorder is now the genetic "patient". Maternal 
teratogenic in utero is an environmental effect. It is thus possible that the same gene 
locus could act in part as a teratogenic locus and in part as a modifying or specificity 
locus. 

DISCUSSION 

The Example of Phenylketonuria: An example of the gene-teratogen model is the 
25 major intrauterine effect in maternal phenylketonuria (PKU). Phenylketonuria itself 
is a recessive postnatal disorder. Untreated homozygous PKU mothers and fathers 
both have elevated blood phenylalanine (hyperphenylalaninemia). However, 



10 



nj 



15 



20 
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heterozygous offspring of untreated PKU mothers (but not fathers) have an abnormal 
phenotype.[Koch et al, Acta Paediatr Suppl, 407: 111-119 (1994); Allen et aL, Acta 
Paediatr Suppl 407:83-85 (1994); Abadie et al, Archives Pediatr, 3:489-486 
(1996)]. Thus the elevated blood phenylalanine or other metabolite(s) in the mother 
5 acts as a teratogen for the fetus. Note that the fetus of an untreated phenylketonuria 
mother does not have the phenotype of PKU (the "teratogenic phenotype"), but has a 
different phenotype (the "teratogen-induced phenotype 1 '). 
Phenylketonurias [Menkes, Textbook of Child Neurology, Lea & Febiger, 
Philadelphia (1990)] are normal at birth and develop a progressive disorder 
10 postnatally characterized by vomiting, eczema, seizures (infantile spasms with 
ijl hypsarrythmia on electroencephalography), and mental retardation. The fetus of an 

I J untreated phenylketonuria mother [Menkes, Textbook of Child Neurology, Lea & 

Febiger, Philadelphia (1990)] has a congenital non-progressive disorder of fetal 

Oj 

pi origin characterized by microcephaly, abnormal facies, mental retardation, congenital 



15 heart disease, and prenatal and postnatal growth retardation. The PKU phenotype is a 
S3 postnatal degenerative disorder; the phenotype of the PKU intrauterine effect is a 



developmental disorder. The teratogenic effect is not dependent upon the fetal 
genotype, although the fetus is an obligate heterozygote since the mother is a 
homozygote for phenylketonuria and the father (usually) has the normal genotype. 
20 Thus, in phenylketonuria, a mutation at the same gene locus causes two distinct 
disorders depending upon whether the period of abnormal gene 

action is prenatal or postnatal. A fetus with the abnormal homozygous genotype who 
is carried by a heterozygous mother is protected in utero, but develops PKU 
postnatally. A heterozygous fetus carried by a mother with the abnormal 
25 homozygous genotype is damaged in utero when the mother's genotype predominates, 
but is protected from PKU postnatally by its own genotype. 



An Example from Studies in Inbred Mice: Finnell and Chernoff [Gene-teratagen 
interactions: an approach to understanding the metabolic basis of birth defects, In 
Pharmacokinetics in Teratogenesis,Vol. 11:97-109 Experimental Aspects In Vivo and 
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In Vitro, CRC Press, Inc, Boca Ratan, Fl. (1987)] have reviewed a group of elegant 
experiments in inbred mice documenting that differences in susceptibility to 
exogenous teratogens can be regarded as a genetic trait that is determined by 
susceptibility or liability genes of either the maternal or fetal genotype [Finnell and 
5 Chernoff, Gene-teratagen interactions: an approach to understanding the metabolic 
basis of birth defects, In Pharmacokinetics in Teratogenesis,Vol. 11:97-109 
Experimental Aspects In Vivo and In Vitro, CRC Press, Inc, Boca Ratan, Fl. (1987)]; 
Finnell et aL, Am J. Med. Genet 70:303-31 1 (1997); Bennett et al.Epilepsia 38:415- 
423 (1997)]. For example, sensitivity to acetazolamine-induced ectrodactyly is 

10 determined by the presence of three genes, and the fetus must be homozygous for the 
recessive allele at all three loci in order to express the malformation. However, the 
inbred mouse models used do not mirror the human situation in at least three respects. 
First, the human population is an outbred population compared to these inbred mouse 
models. Consequently, the relevant genotypes may be highly variable among 

15 members of different families. Second, the inbred mouse experiments address the 
question of exogenous rather than endogenous teratogens. Third, the inbred mouse 
studies rely upon known or candidate susceptibility loci, whereas in humans, the 
problem has been to locate and identify disease unknown loci largely by using linkage 
mapping techniques. 

20 Implications for Linkage Mapping: 

Teratogenic Locus (Locus A or B): The gene-teratogen model has major implications 
for linkage mapping done with either parametric or non-parametric methods. The 
problem for both methods is incorrect assignment of affection status. In the lod score 
method, a genetic model of the disease is constructed and an affection status is 

25 assigned to each member of the pedigree. If the genetic model specified is wrong, the 
linkage results may be falsely positive or falsely negative [Terwilliger and Ott, 
Handbook of Human Genetic Linkage, Johns Hopkins Univ. Pr., Baltimore (1994)]. 
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In developmental disorders resulting from the gene-teratogen model, the phenotype 
assignment for lod score analysis will be incorrect. The patient with the 
developmental disorder will be assigned the affected phenotype, whereas the patient 
is actually affected only for the teratogen-induced phenotype, but is unaffected for the 
5 teratogenic phenotype. Likewise, the mother will be assigned the unaffected 
phenotype for linkage analysis. Actually, she is unaffected only for the 
teratogen-induced phenotype, but is affected for the teratogenic phenotype. Lod 
scores should increase when phenotype assignments have been corrected. However, 
apparently dominant inheritance may in fact turn out to be pseudodominant if the 
43 10 mutant allele is common in the population. For non-parametric analysis, a similar 

?i misassignment occurs. In the case of affected sib-pairs, the affected sibs will be 

J/f assigned the affected phenotype. Actually, the sibs are affected only for the 

01 teratogen-induced phenotype, but are unaffected for the teratogenic phenotype. The 

^ " mother will be assigned the unaffected or unknown phenotype. Actually, she is 

15 unaffected only for the teratogen-induced phenotype but is affected for the teratogenic 

yi 

fy phenotype. Thus, the "affected sib-pair" families are likely to turn out to contain only 

a single sporadic case, since the only individual in the kindred affected with the 
53 teratogenic phenotype will be the mother. 



For the transmission/disequilibrium test (TDT) [Spielman et aL, Am J Hum Genet, 
20 52:506-516 (1993); Ewens and Spielman, Am J Hum Genet, 57:455-464 (1995)] the 
patient with the developmental disorder will be assigned the affected phenotype. 
Actually, the patient will be affected only for the teratogen-induced phenotype but 
will be unaffected for the teratogenic phenotype. The mother will be assigned the 
unaffected or unknown phenotype. Actually, she is unaffected only for the 
25 teratogen-induced phenotype but is affected for the teratogenic phenotype. The 

expectation of TDT is that alleles of a linked locus will show distortion from random 
transmission from mother (or father) to the patient. Since the patient is unaffected for 
the teratogenic phenotype, no transmission distortion from mother (or father) to child 
will be observed. Transmission distortion for alleles of a teratogenic locus will in fact 
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occur from the mother's parents to the mother, the actual patient for the teratogenic 
phenotype. But this will not be looked for because the phenotypes have been wrongly 
assigned. In addition, grandparents of the patients with the developmental disorder 
have probably not had DNA collected. Therefore, for the TDT, negative results may 
5 occur for disease alleles of a teratogenic locus because incorrect phenotype 

assignments will have been made. When correct phenotype assignments have been 
made, transmission distortion to the mother from her parents should be expected for 
disease alleles of a teratogenic locus. Analogous misassignments are made in allelic 
association and haplotype relative-risk analyses [Falk and Rubinstein, Ann Hu, Genet, 

Cj 10 51:227-233 (1987); Terwilliger and Ott, Hum Hered, 42:337-346 (1992); Thomson, 

Ul Am J Hum Genet, 57:487-498 (1995)]. 

Jjf Modifying or Specificity Loci (Locus C and/or D) \ Since these loci behave in a 

yl 

01 conventional fashion, the phenotype assignments will be correct. Consequently, 

l*~ genes identified by conventional parametric or non-parametric linkage studies are 

Jfl 15 likely to be modifying or specificity loci. An important question for linkage mapping 

fjj is the relative contribution to the abnormal phenotype of the developmental disorder 

~5f made by the teratogenic locus versus that of a modifying or specificity locus. If the 

effect of a teratogenic locus is small, then loci identified by conventional linkage 

studies will be specificity or modifying loci and the mode of inheritance will be 
20 Mendelian or multifactorial. If a teratogenic locus makes a major contribution to 

phenotype, then linkage mapping studies will not give a consistent answer and the 

mode of inheritance will be difficult to determine. 



The presence of a teratogenic locus may be suspected if the maternal contribution to 
phenotype is different from or greater than the paternal contribution. For example, 
25 the mother's relatives of spina bifida infants more frequently have affected children 
than the father's relatives. Suggested explanations for this observation have been 
mitochondrial inheritance, maternal effect, or genomic imprinting [Chatkupt, Am J 
Med Genet, 44:508-512 (1992)]. The operation of a teratogenic locus is another 
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explanation and is itself a form of maternal effect. For a recessive teratogenic locus, 
the mother's sisters would be at greatest risk of having offspring with the 
teratogen-induced phenotype. 

Implications for Definition of Phenotype: All the pregnancies of a mother with the 
teratogenic phenotype are at risk for the developmental disorder, the 
teratogen-induced phenotype. Yet only a few of the fetuses will be affected by the 
developmental disorder because of the action of environmental factors and/or the 
modifying or specificity loci. The action of the environmental factors is fully 
quantitative: depending upon the amplitude of the environmental effect, a mild, 
moderate, or severe teratogen-induced phenotype may result. In addition, the 
environmental factor may act at different times in fetal development producing 
qualitatively different pheno types. Thus, quantitatively or qualitatively different 
teratogen-induced pheno types may result from pregnancies of the same mother with 
the teratogenic phenotype. In addition, the action of the modifying or specificity loci 
may produce quantitatively or qualitatively different phenotypes in offspring of the 
same couple. Such different phenotypes may be diagnostically classified as different 
disorders. This may complicate attempts at associating specific loci with a 
specific teratogen-induced phenotype. All of the teratogen-induced phenotypes 
resulting from pregnancies of a mother with the teratogenic phenotype modified only 
by environmental factors are genetically indistinguishable. However, such 
teratogen-induced phenotypes affected also by the various modifying or specificity 
loci segregating among the offspring of a single couple are only partially genetically 
related. 

Methods to Identify Teratogenic Loci: One effective approach to finding a putative 
25 teratogenic locus is to carry out non-parametric linkage studies of families consisting 
of a patient affected with the developmental disorder, the patient's two (unaffected) 
parents, and the patient's four (unaffected) grandparents (Table 1). In such a family, 
the mother is the genetic patient but the other family members are not. Now, the 
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mother's nuclear family (the mother and her parents) is compared with the father's 
nuclear family (the father and his parents). In a haplotype relative risk study, the 
disease allele(s) of the teratogenic locus will occur more frequently in the mother 
compared with other alleles of her parents; the disease allele(s) of the teratogenic 
5 locus will not occur more frequently in the father compared with other alleles of his 
parents. In a transmission/disequilibrium test, transmission distortion will be seen for 
the disease allele(s) of a teratogenic locus in the mother's nuclear family but not in the 
father's nuclear family. In an allelic association study, the disease allele will occur 
more frequently in mothers, patients (with the developmental disorder), and patient's 
^ 10 sibs (both affected and unaffected) than in unrelated control individuals. Disease 

^3 allele frequency in fathers will not be distinguishable from that in control individuals. 

Ul 
* s ? j. 

~* Certain developmental disorders with a genetic component to etiology, whose mode 

P 1 of inheritance has been difficult to determine or whose genetic factors have been 

a difficult to locate, including those mentioned earlier, are candidates for the 
J^f 15 gene-teratogen model. 

nj 

p MODEL 2 : 

^ J The DNA Polymorphism-Diet-Cofactor-Development Hypothesis 

for Schizophrenia and Other Developmental Disorders 



Folate metabolism is complex. At least 30 gene loci are involved in absorption, 
20 transport, and metabolism of folate, and these are regulated by additional gene loci. 
Any of these is potentially a genetic risk factor for schizophrenia, although MTHFR 
and DHFR are particularly good candidates. Likewise, genes encoding proteins 
involved in the pathways of other vitamin-cofactors may be genetic risk factors. 



25 



Two cofactors that may be of particular potential importance are cobalamin and 
pyridoxine. Cobalamin is relevant because its metabolism is closely intertwined with 
that of folate. For example, cobalamin is required for the activity of methionine 
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synthase (MTR), a folate-related enzyme. Decreased cobalamin can affect folate 
metabolism through the folate trap. Pyridoxine is relevant because the 
pyridoxine-dependent enzyme cystathionine beta-synthase (CBS), along with the 
cobalamin-dependent enzyme MTR and folate pathways including MTHFR and 
5 DHFR all participate in catabolism of homocysteine, an amino acid that is suspected 
of being a teratogen during pregnancy. Also, kynureninase, an important enzyme 
affecting niacin metabolism and serotonin synthesis is pyridoxine-dependent. 
Therefore, mutations of the genes encoding such proteins, especially common 
polymorphisms, could play a role in the cause of schizophrenia. 

Since folate, cobalamin, and pyridoxine are all dietary constituents, the dietary 
content of these cofactors could be lead to an "environmental" generation of a risk 
factor for schizophrenia. In addition genes encoding proteins involved in folate, 
cobalamin, and pyridoxine metabolism and catabolism could be genetic risk factors 
for schizophrenia. Thus, the cofactors and the proteins involved in pathways relevant 
to these cofactors can potentially have either or both environmental and genetic 
effects on the susceptibility of an individual on schizophrenia. 

Since the genetic aspect of schizophrenia differs so profoundly from other disorders 
which have been identified by linkage mapping techniques, it is clear that a new 
model for the genetic connection to schizophrenia is required. Therefore, the DNA 
Polymorphism-Diet-Cofactor-Development (DDCD) hypothesis, is disclosed herein. 

The DDCD hypothesis is that interacting genetic and environmental factors affecting 
the metabolism of folate, cobalamin, or pyridoxine or all of these, play a role in the 
etiology of schizophrenia. The genetic effect results from the aggregate effect of 
multiple mutations that individually, for the most part, have small effects on folate-, 
25 cobalamin- or pyridoxine-related genes, some of which will be common in the 

population, and can act in utero. Environmental factors include dietary folate and 
cobalamin and pyridoxine. If schizophrenia results from mild deficiency during fetal 
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development of dietary folate, cobalamin, or pyridoxine potentiated by mild genetic 
susceptibility mutations of genes related to these cofactors and by pregnancy, then 
this would be difficult to document by linkage mapping techniques. An example of 
interaction of genetic and environmental factors is that genetic factors are important 
5 for incorporating dietary folate; the enzyme dihydrofolate reductase is required for 
conversion of dietary folate to folinic acid thus allowing dietary folate to enter the 
body's metabolic pathways. Another example is that folate and cobalamin 
requirements increase during pregnancy; thus pregnancy could potentiate the effects 
of mild genetic defects of mother, fetus, or both. Deficiencies of a vitamin are often 
10 part of a broader dietary deficiency affecting multiple nutrients in addition to the 
vitamin being measured. 

Locus Heterogeneity: The metabolic pathways of folate, cobalamin, and pyridoxine 
are complex and related to each other. Multiple gene loci code for the enzymes and 
transport proteins are required (Tables 2-7). Thus, a defect of folate, cobalamin, or 
pyridoxine metabolism could result from the aggregate effect of multiple mutations 
each of relatively small effect interacting with environmental factors. Different 
individuals might have different combinations of mutations. Such a metabolic defect 
would be difficult to detect by linkage mapping techniques because of locus 
heterogeneity. 

20 Alternatively, even if one genetic defect were sufficient to make an individual more 
susceptible to having schizophrenic offspring, for example, because of the large 
number of potential genetic factors, and the corresponding importance of 
environmental factors, elucidation of such an individual genetic defect would still be 
difficult unless, of course, the genetic defect caused a major effect. The difficulty in 

25 elucidating an individual genetic defect is magnified when the genetic factor acts in 
the mother, and not in the schizophrenic patient. 
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High Disease Allele Frequency: Numerous mutational variants of folate and 
cobalamin genes are known. Some of these have functional significance and in 
addition are sufficiently common in a given population to be regarded as genetic 
polymorphisms. However, these common alleles are unlikely to have a major 
5 harmful effect by themselves, for if they did they would become uncommon in the 
population in the absence of selection effects, and would likely appear as Mendelian 
disorders. Thus, the folate, cobalamin, or pyridoxine disease alleles related to 
schizophrenia would appear to be more likely those of minor deleterious effect or 
those with harmful effect only in the presence of environmental deficiencies or 
10 pregnancy. Such disease genes of high population frequency will be difficult to 

detect by linkage mapping methods because high disease allele frequency decreases 
the power of linkage studies [Terwilliger and Ott, Handbook of Human Genetic 
Linkage, John Hopkins Univ. Press, Baltimore, (1994)]. 



Developmental Genes: Folate, cobalamin, and pyridoxine defects act prenatally as 
hi 1 5 well as postnatally . Folate, cobalamin, and pyridoxine metabolism are crucial for 

W~ DNA synthesis and cell division, which are of disproportionate importance during 

brain development. Some defects of folate, cobalamin, or pyridoxine metabolism 
elevate blood homocysteine, a toxic and potentially teratogenic substance. Genes 
acting in the mother to damage the developing fetus, e.g. via the gene-teratogen 
20 model (Model 1, above), have a mode of inheritance that is neither dominant nor 
recessive with respect to the fetus. Attempts to assign a mode of inheritance in this 
situation will be unsatisfactory because affection status would be incorrectly assigned. 
The mode of inheritance of a developmental disorder resulting from a teratogenic 
locus would be regarded as either multifactorial or unknown. This is the situation 
25 with schizophrenia whose mode of inheritance is unknown. Use of an incorrect 
genetic model decreases the power of a linkage studies [Terwilliger and Ott, 
Handbook of Human Genetic Linkage, John Hopkins Univ. Press, Baltimore, 
(1994)]. 



• 
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Genes of Folate Metabolism: Folate metabolism is extremely complex [Rosenblatt, 
In: The Metabolic and Molecular Bases of Inherited Disease, Scriver et al. (eds), New 
York: McGraw-Hill, pp. 31 1 1-3128 (1995); Mudd et al, In: The Metabolic and 
Molecular Bases of Inherited Disease, Scriver et al (eds), New York: McGraw-Hill 
5 pp. 1279-1327 (1995)]. At least 30 gene loci (Table 2) have been identified as 
folate-related. These contribute to folate mediated 1 -carbon transfer reactions, 
binding, transport and metabolism of folate, and other functions. A number of these 
have been cloned and localized to a chromosomal region (Table 3). 



pi 




55 



TABLE 2 

FOLATE-RELATED GENES/ENZYMES/TRANSPORTERS 1 





Folate-Related Genes/Enzymes/Tranporters 3 


SEQ ID NO: 




methylenetetrahydrofolate reductase, MTHFR, MIM 236250 


1 


5 


methionine synthase (methyltetrahydrofolate:L-homocysteine 
S-methyltransferase), MTR, MIM 156570 


2 




dihydrofolate reductase, DHFR, MIM 126060 


3 




folylpolyglutamate synthase, FPGS, MIM 136510 


4 


S 10 


folate receptor 1, folate receptor alpha (FOLR1, adult; FR-alpha), 
MIM 136430 


5 




folate receptor 2, folate receptor beta (FOLR2, fetal; FR-beta), 
MIM 136425 (a.a.) 


6 


>J 
pj z 


folate receptor 2-like (FOLR2L, fetal-like), MIM-none 






folate receptor gamma (FR-gamma), MIM 602469 


7 


15 


serine hydroxymethyltransferase 1, SHMT1, MIM 182144 


8 


i g* 

yj 


methylenetetrahydrofolate dehydrogenase, methenyltetrahydrofolate 
cyclohydrolase, 10-formyltetrahydro folate synthetase (trifunctional enzyme, 
MTHFD), MIM 172460 


Q 




serine hydroxymethyltransferase 2, SHMT2, MIM 138450 


10 


^ 20 


thymidylate synthase, TYMS, MIM 188350 


11 




GAR (5-phosphoribosylglycineamide) transformylase, GART, MIM 138440 


12 




reduced folate carrier- 1, RFC1. Probably identical to micromolar membrane 
transport protein, intestinal folate carrier- 1 (IFC1), and neutral folate transport 
protein. MIM 600424 


13 


25 


cystathionine beta-synthase, CBS, MIM 236200 


14 




AICAR (5-phosphoribosyl-5-arninoimidazole-4-carboxamide) transformylase 


15 




glutamate formiminotransferase, MIM 229100 






forminotetrahydrofolate cyclodearninase 






5, 10-methenyltetrahydrofolate synthetase 


16 


30 


1 0-formyltetrahydrofolate dehydrogenase, Mim 600249 
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Folate-Related Genes/Enzymes/Tranporters a 


SEQ ID NO: 


glycine cleavage pathway (SHMT plus three enzymes): 
MIM 238331 

H-Protein MIM 238330 
T-Protein MIM 238310 


17 
18 
19 


cblG (affects function of MTR), MIM 250940 




methionme adenosyltransferase l, MATIA, (ATP:L-methionuie S- 
adenosyltransferase), MIM 250850 


20 


pteroyl polyglutamate hydrolase ("conjugase"), form l 




pteroyl polyglutamate hydrolase ( conjugase ), form 2 




NAD-dependent enzyme methylene tetrahydrofolate dehydrogenase 
cyclohydrolase (a.a.) 


21 


methionine adenosyltransferase 2, MAT2A, MIM 601468 


22 


5-methyltetrahydrofolate- homocysteine methyltransferase reductase (MTRR) 
MIM 602568; #Variant in MTRR linked to cblE MIM 236270 


23 


methyltranferases 




S-adenosylmethionine decarboxylase, MIM 180980 


24 


decarboxylated S-adenosylmethionine :putrescine propylaminotransferase or 
spermidine synthetase (a.a.) 


25 


S-adenosylhomocysteine hydrolase, , MIM 180960 


26 


betaine-homocysteine methyltransferase dimethylthetin-homocysteine 
methyltransferase 


27 


gamma-cystathionase (L-cystationine cysteine-lyase (deaminating)), 
MIM 602888 


28 


folic acid transport protein, MIM 229050 




DHFR (exon 6 and 3 'flanking region) 


30 


kynureninase 


35 


human DHFR, exons 1 and 2 [Chen et al.J. Biol. Chem. 259:3933-3943 
(1984)] 


36 


a listed with alternate names, abbreviations, and MIM numbers; 

#cblE is a phenotype for a particular group of disorders of folate/cobalamin metabolism, 
(a.a.) indicates the amino acid sequence 
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TABLE 3 

LOCALIZED GENE LOCI RELATED TO FOLATE METABOLISM 



Gene/enzyme/transport protein 


Location 


References 


MTHFR 


lp36.3 


Goyette et a/.,(l994); 
* ** 


MTR 


lq43 


Cook and Hamerton, (1979); 

Mellman et al, (1979) 
** 


DHFR 


5qll.2-13.2 


Weiffenbach et al., (1991) 

Gilliam al (1989b) 
* ** 


FPGS 


9cen-q34 


Jones and Kao(l984): 

Walters al (1992) 
* ** 


MAT 


10q22 


** 


FR 


Ilql3.3-ql4.1 

11 1^1 i 1 O ^ 

Ilql3.3-ll3.5 


Lacey et al (1989), 
Ragoussis et al, (1992); 
Ratnume/a/ H989V 
Walter et al. (1992); * 
Ragoussis et al, (1992), ** 


SHMT2 


I2ql2-ql4 
I2ql3 


Garrow et al, (1993); 
Law and Kao, (1979) * 


MTHFD 


I4q24 


Rozen et al., (1989), 
Jones etal. (1981), * 3 ** 


LCCL 


I6pter-qter 




SHMT1 


I7pll.2 


Garrow et al, (1993) *, ** 


TYMS 


I8pll.3l.-pll.22 
I8pll.32 


Hori et al, (1990); 
Silverman et al, (1993) 


SAHH 


20cen-ql3.l 


* 


GART 


2lq22.l 


Mclnnis et al (1993) 
Schild et al (1990) 
Avrarmopoulos et al (1993) 

Goto etal (1993) 

* ** 

> 


RFC1 


2lq22.2-22.3 


Moscow et al, (1995) 
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Gene/enzyme/transport protein | Location 



References 



CBS 21q22.3 



Munke etal, (1988) 



5 



10 

U] 

>j 15 

==3 : 

Ul 20 

0J 

Ui- 
pi 

25 



30 



35 



40 



notes: MTHFR=methylenetetrahydrofolate reductase. MTS-methionine synthase. 
DHFR=dihydrofolate reductase. FPGS=folylpolyglutamate synthase. 
MAT=methionine adenosyltransferase, (ATP:L-methionine S-adenosyltransferase). 
FR=folate receptor complex: FR-alpha=FOLRl=folate receptor 1, adult; FR- 
beta=FOLR2=folate receptor 2, fetal; FR-gamma; FOLR2L=folate receptor 2-like. 
SHMT2=serine hydroxymethyltransferase 2, mitochondrial. MTHFD=5, 10- 
methylenetetrahydrofolate dehydrogenase, 5, 10-methylenetetrahydrofolate 
cyclohydrolase, 10-formytetrahydrofolate synthase (trifunctional enzyme). 
LCCL=gamma-cystathionase (L-cystathionine cysteine-lyase (deaminating). 
SHMTl=serine hydroxymethyltransferase 1, soluble. TYMS=thymidylate 
synthetase. SAHH, S-adenosylhomocysteine hydrolase. 

GART=phosphoribosylglycineamide formyl transferase. RFCl=reduced folate 
carrier- 1 (possibly identical to IFC1, intestinal folate carrier- 1). CBS=cystathionine 
beta-synthase. Location information from GOD (*), from MIM (**). 

Goyette et al, Nat Gen. 7:195-200 (1994) 

Cook and Hamerton, Cytogenet Cell Genet. 25:9-20 (1979) 

Mellman et al, Proc. Natl Acad. Set 76:405-409 (1979) 

Weiffenbach et al, Genomics 10:173-185 (1991) 

Gilliam et al. Genomics 5:940-944 (1989b) 

Jones and Kao Cytogenet Cell Genet 37: 499 (1984) 

Walter et al Ann. Hum. Genet 56:212 (1992) 

Lacey et al. Am J. Med. Genet 60:172-173 (1989) 

Ragoussis et al, Genomics 14:423-430 (1992) 

Ratnum et al Biochem. 28:8249-8254 (1989) 

Garrow etal. J. Biol. Chem. 268:11910-11916(1993). 

Law and Kao 5 Cytogenet Cell Genet, 24: 102-114 (1979) 

Rozen et al.,Ann. Hum. Genet, 44:781-786 (1989) 

Jones et al Somat Cell Genet. 7:399-409 (1981) 

Hon etal, Hum. Genet 85:576-580 (1990) 

Silverman et al, Genomics 15:442-445 (1993) 

Mclnnis et al Genomics 16:562-571 (1993) 

Schild et al. Proc. Natl Acad. Sci 87:2916-2920 (1990) 

Avrarmopoulos et al Genomics 15:98-102 (1993) 

Goto et al Neuromusc Disord. 3:157-160 (1993) 

Moscow et al, Cancer Res. 55:3790-3794 (1995) 

Munke et al Am J. Hum. Gen. 42:550-559 (1988) 



Genes of Cobalamin Metabolism: Cobalamin metabolism is also complex [Benton 
and Rosenberg, In: The Metabolic and Molecular Bases of Inherited Disease, 
Disease, Scriver et al (eds), New York: McGraw-Hill, 3129-3149 (1995)]. At least 
15 gene loci (Table 4) have been identified as cobalamin-related. These contribute to 
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the binding, transport, and metabolism of cobalamin, and its functions. A number of 
these have been cloned and localized to a chromosomal region (5). Cobalamin 
metabolism is closely intertwined with that of folate. For example, cobalamin is 
required for the activity of MTR, a folate-related enzyme. Decreased cobalamin can 
affect folate metabolism through the folate trap [Rosenblatt, In: The Metabolic and 
Molecular Bases of Inherited Disease, Scriver et al (eds), New York: McGraw-Hill, 
pp. 3111-3128 (1995); Quadras et al, Biochem. Biophys. Res. Commun., 222:149- 
154(1996)]. 
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TABLE 4 



COBALAMIN-RELATED GENES/ENZYMES/TRANSPORTERS' 





(~*r\\\c\ 1 q m i n _T? p»l q t^H f^^rif*c/T^Ti7VT*nf*c/ f T'r5i'nr\r*T*t( : »rc a 


SEO ID NO* 


5 


i gdainu^ liiLriiibit/ idciur, vJ.Lr, iviiivi-^tii kjkjkj ^wuiiiumcu u.ciiwiciiwjf ui 
GIF & R-binder, MIM 243320 


31 




intrinsic factor receptor, IFCR, MIM-261 100 




'43 


transcobalamin I, TCI (an R-protein, plasma), MIM 189905 


32 


S 


transcobalamin III, TCIII (an R-protein, plasma), MIM-none 






oincr i\.-proLCino ^iv-uiiilici &, wuu«.i*jpiiyiiiio, iiapiu^uiiiiio^, iviiivi lz^j\jz^\j 




?} 10 


transcobalamin II, TCII MIM 275350 


33 




transcobalamin II receptor, TCII receptor, MIM-none 




ih 


methylmalonyl Co- A mutase, MCM (MUT locus), MIM 25 1000 


34 


yj 


cblF, lysosomal cbl efflux, MIM 277380 






cblC, cytosolic cbl metabolism, MIM 277400 




15 


cblD, cytosolic cbl metabolism, MIM 277410 






cblA, mitochondrial cbl reduction, (AdoCbl synthesis only), MIM 
251100 






cblB, cob(I)alamin adenosyl transferase, (AdoCbl synthesis only), MIM 
251110 




20 


cblE, methyltransferase-associated cbl utilization, MIM 236270 






cblG, methyltransferase-associated cbl utilization, MIM 250940 






a listed with alternate names, abbreviations, and MIM numbers 
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TABLE 5 



LOCALIZED GENE LOCI RELATED TO COBALAMIN METABOLISM 



Gene/enzyme/transport protein 


Location 


References 


MCM (MUT locus) 


6p21.2-p21.1 


Qureshi etal (1994) * 


IF/GIF 


Ilql2-ql3 


Hewitt al (1991) * 


TCI (an R-protein, plasma) 


llqll-ql2.3 


Johnston et al, (1992) 
Sigal et al, (1987), * 


TCII 


22qll.2-ql3 
22ql2/13 border 


Uetal, (1995) 


notes: MCM=methymalonyl Co-Amutase; IF/GIF=(gastric) intrinsic factor; 
TCI=transcobalmin I; TCII=transcobalamin II. Location information from GDB (*), from 
MIM (**). 


Qureshi etal, Crit. Rev, Oncol Hematol 17:133-151 (1994) 

Hewitt al, Genomics 10:432-440 (1991) 

Johnston et a/.,Genomics 12:459-464 (1992) 

Sigal et al,N. Engl J. Med. 317:1330-1332 (1987) 

Li et al, Biochem. Biophys. Res. Comm. 208:756-764 (1995) 



Genes ofPyridoxine Metabolism: Pyridoxine metabolism is also complex with three 
dietary forms convertible to pyridoxal phosphate [Whyte et al, Hypophosphatasia, 
In: The Metabolic and Molecular Bases of Inherited Disease, Scriver et al. (eds), New 
York: McGraw-Hill pp. 4095-41 1 1 (1995)] and many pyridoxine-related and 

20 pyridoxine-dependent enzymes including decarboxylases and all aminotranferases 
(Table 6). A number of pyridoxine-related enzymes have been cloned and localized 
to a chromosomal region (Table 7). Pyridoxine metabolism is related to folate 
metabolism, especially 1 -carbon transfer reactions: both serine 
hydroxymethyltransferases and the P-protein (glycine decarboxylase) of the glycine 

25 breakdown system are pyridoxine-dependent. 
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TABLE 6 

SOME PYRIDOXINE-RELATED GENES/ENZYMES/ a 



1 . cystathionine beta-synthase, CBS, 



MIM 236200 



2. gamma-cystathionase 5 MIM 219500 
5 (L-cystathionine cysteine-lyase, deaminating), LCCL 

3. glycine cleavage system (GCS): glycine decarboxylase (P -protein) 



4. serine hydroxymethyltransferase 1, SHMT1, 



MIM 182144 



5. serine hydroxymethyltransferase 2, SHMT2, 



10 6 . kynureninase 



MIM 138450 



MIM 278600 



7. all aminotransferases, MIM 258870 

(e.g. ornithine-gamma-aminotranferases, OAT, ) 

8 . decarboxylases, MIM 266 1 00 
15 e.g. glutamic acid decarboxylases, GAD1, GAD2, 



9. pyridoxamine(pyridoxine)-5 '-phosphate oxidase 



MIM 603287 



listed with alternate names, abbreviations, and MIM numbers. 
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TABLE 7 

SOME LOCALIZED GENE LOCI RELATED TO PYRIDOXINE METABOLISM 



Gene/enzyme 

1 . GAD 2 

5 2 . GCS P-protein 

3 . GAD1 

4 . OAT 

5 . SHMT2 

10 6 . LCCL 

7 . SHMT1 

8 . CBS 

9. PNPO (PPO) 



Location 
2q31, 
9pl3 

lOpll .23 

10q26 

12ql2-14 

16pter-qter 
17pll . 2 
21q22 . 3 



References 
Bu et al . , 1992) 
Hamosh et al. 1995) 

Bu et al. 1992) 

* * 

Garrow et al . , 1993; 

Law and Kao, 197 9 

* * * 

Garrow et al. 1993 * ** 
Munke et al. 1988 
Ngo et al. 1998 



a listed with alternate names, abbreviations, and MIM numbers. 
15 Location information from GDB (*), from MIM (**). 

notes: GAD2=glutamic acid decarboxylase 2, 67 kDa. GCS=glycine cleaving system, 
P-protein=glycine decarboxylase subunit. GADl=glutamic acid decarboxylase 1, 
65 kDa. OAT=ornithine-gamma-aminotranferases. SHMT2=serine 
hydroxymethyltransferase 2, mitochondrial. LCCL=gamma-cystathionase 
20 (L-cystathionine cysteine-lyase (deaminating). SHMTl=serine 

hydroxymethyltransferase 1, soluble. CBS=cystathionine beta-synthase. PNPO= 
pyridoxamine(pyridoxine)-5 '-phosphate oxidase 
References : 

Bu et a/., Proc. Nat. Acad. ScL, 89:2115 (1992). 
25 Hamosh et al., In: "The Metabolic and Molecular Bases of Inherited Disease", 
Scriver et al (eds), New York: McGraw-Hill pp.1337-1348 (1995). 

Garrow et al. J. Biol. Chem. 268:11910-11916 (1993). 

Law and Kao, Cytogenet Cell Genet, 24: 102-1 14 (1979). 

Munke et al. Am J. Hum. Gen. 42:550-559 (1988). 
30 Ngo et al. Biochemistry 37/7741-7748 (1998). 
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Relevance of Folate, Cobalamine, And Pyridoxine to Schizophrenia: There is 
considerable evidence that schizophrenia results, at least in part, from damage to brain 
development in utero that becomes symptomatic in late adolescence or early 
adulthood. The etiology of schizophrenia has both genetic and environmental 
5 components. Because folate, cobalamin, and pyridoxine are all ingested and 

metabolized, they could potentially be both environmental and genetic factors for 
schizophrenia. Folate, cobalamin, and pyridoxine are relevant to schizophrenia in 
important ways. First, all of them are required for cell division because of their role in 
nucleic acid synthesis [Rosenblatt, In: The Metabolic and Molecular Bases of 
y I 10 Inherited Disease, Scriver et al. (eds) New York: McGraw-Hill, pp. 3111-3128 

Si (1995); Benton and Rosenberg, In: The Metabolic and Molecular Bases of Inherited 

L: Disease, Scriver et al. (eds)., New York: McGraw-Hill, 3129-3149 (1995)]. The 

V 1 developmental brain insult implicated in schizophrenia [Akbarian et al, Arch. Gen. 

Q Psychiatry, 50:169-177 (1993); Akbarian et al, Arch. Gen. Psychiatry, 50:178-187 

LP 

~h 15 (1993)] is an abnormality of neurogenesis and neuronal migration, which are 

W midtrimester events requiring cell division. Thus folate, cobalamin, and pyridoxine 

p deficiencies could result in the widespread decreased grey matter volume observed in 

schizophrenia. 



Individuals that become schizophrenic later in life are more likely to be bom during 
20 the winter and early spring [Boyd et al, Schizophr. Bull, 12:173-186 (1986); Kendell 
and Adams, Br. J. Psychiatry, 158:758-763 (1991); O'Callaghan et al, Br. J. 
Psychiatry, 158:764-769 (1991)]; this corresponds to midtrimester in late fall & 
winter. Many folate- and pyridoxine-containing foods, e.g. dark green leafy 
vegetables, are less readily available in late fall & winter in northern climates. 
25 Seasonality was found to be a major determinant of micronutrient status including 
folate status in a population of pregnant and lactating women in The Gambia where 
folate deficiency was widespread [Bates et al Eur. J. Clin. Nutr. 48:660-668 (1994)]. 
Dietary cobalamin comes from animal foods, e.g. meat, dairy products, and fish, and 
prolonged dietary insufficiency is required to produce cobalamin deficiency unless a 
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person is a strict vegetarian or already has subclinical deficiency [Sanders and Reddy, 
Am. J. Clin. Nutr., 59:1 176S-1 181S (1994)]. In fact, a significant fraction of the 
population already has subclinical deficiency for folate [Lewis et al, Ann. NY Acad. 
ScL, 678:360-362 (1993)] and for [Carmel etal, Arch. Intern. Med., 147:1995-1996 
5 (1987); Pennypacker et al, J. Am. Geriatr. Soc, 40:1 197-1204 (1992); Naurath et al, 
Lancet, 346:85-89(1995); Allen et al, Am. J. Clin. Nutr., 62:1013-1019(1995); 
Black et al, J. Nutr., 124:1 179-1 188 (1994)]. Also, the dietary folate requirement 
increases during pregnancy [Scholl et al, Am. J. clin. Nutr., 63:520-525 (1996); 
Jj: McPartlin et al, Lancet, 341:148-149 (1993)] and most women become folate 

Ml 10 deficient during late pregnancy [Giles, J. Clin. Pathol, 19:1-11 (1966)]. Cobalamin 

%l deficiency is also common during pregnancy [Gadowsky et al, J. Adolesc. Health, 

16:465-474 (1995)] although subnormal levels of vitamin B12 during pregnancy must 
be interpreted with caution [Metz et al, Am. J. Hemetol, 48:251-255 (1995)]. An 
increase in schizophrenia births has also been noticed after winter famine [Susser and 
15 Lin, Arch. Gen. Psychiatry, 49:983-988 (1992)]; Susser et al, Arch. Gen. Psychiatry, 
53:25-31 (1996)], a time when severe dietary deficiency of both folate and cobalamin 
is more likely. A temporary increase in the incidence of neural tube defects was 
reported in Jamaica 11-18 months following Hurricane Gilbert and was found to be 
associated with decreased dietary folate [Duff and Cooper, Am J. Pub.Health 84:473- 
20 476(1994)]. 



Schizophrenia is also associated with obstetrical complications, e.g. low birth weight 
and prematurity [Lewis and Murray, J. Psychiatr. Res., 21:413-421 (1987)]. Low 
birthweight and prematurity have also been associated with dietary folate deficiency 
during pregnancy Scholl et al, Am. J. clin. Nutr., 63:520-525 (1996). 
25 Hyperhomocysteinemia is a risk factor for unexplained recurrent early pregnancy loss 
[Wouters et al, Fertil Steril, 60:820-825 (1993)] and for abruptio placentae 
[Goddijn-Wesel et al, Eur. J. Obstet Gynecol Reprod. Biol, 66:23-29 (1996)]. 
Hyperhomocysteinemia may be related to defects in folate-, cobalamin-, or 
pyridoxine-dependent reactions [Naurath et al, Lancet, 346:85-89 (1995)]. 
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Interestingly, stillbirths and schizophrenia share a similar seasonality of birth excess 
[Torrey et al, Schizophr. Bull, 19:557-562 (1993)]. Also N 2 0, an anaesthetic gas 
that inhibits MTR, a cobalamin-requiring enzyme of folate metabolism, is a 
reproductive toxin for both men and women [Louis-Ferdinand, Adverse Drug React 
5 Toxicol Rev., 13: 193-206 (1994)]. Methotrexate, an inhibitor of dihydrofolate 
reductase (DHFR), induces abortion. 

Dietary folate deficiency and low plasma folate are common in inner city urban 
populations [Scholl et ah, Am. J. clin. Nutr., 63:520-525 (1996)]. Likewise, 
schizophrenia has been reported to be more common in inner city urban populations 
10 [Fuller and Bowler, Schizophr. Bull., 16:591-604 (1990)]. Also, both low folate 

intake [Schorah and Wild, Lancet, 341:1417 (1993)] and schizophrenia [Dohrenwned 
et al, Science, 255:946-952 (1992)] are correlated with lower socioeconomic status. 

Immune function is impaired in folate deficiency [LeLeiko and Chao, In: Rudolph 's 
Pediatrics, 20th ed., Stamford, CT: Appleton & Lange, pp. 1001-1010 (1996)], in 

15 cobalamin deficiency [Hitzig et al, Ciba. Found. Symp., 68:77-91 (1978)] and in 
pyridoxine deficiency [Trakatellis et al Postgrad Med. J. 73:617-622 (1997)] and 
deficient individuals are more susceptible to infection. Methotrexate, an inhibitor of 
dihydrofolate reductase, inhibits immune function [Hughes, In: Rudolph 's Pediatrics, 
20th ed., Stamford, CT: Appletone and Lange, pp. 517-519 (1997)]. And, as 

20 mentioned, dietary folate and cobalamin requirements increase during pregnancy 
[Scholl et al, Am. J. clin. Nutr., 63:520-525 (1996); McPartlin et al, Lancet, 
341:148-149 (1993)]. This is relevant because the season-of-birth effect just 
mentioned in connection with dietary folate, or cobalamin deficiency has also been 
explained by in utero infectious illness, the "viral theory" of schizophrenia. 

25 Individuals born following winters with severe influenza epidemics are more likely to 
develop schizophrenia [Adams et al, Br. J. Psychiatry, 163:522-534 (1993)] though 
not all studies find this effect. Although it has not been demonstrated that either the 
schizophrenia fetus or the pregnant mother actually developed influenza, the 
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histologic pattern in schizophrenia of a neuronal migration abnormality during brain 
development has been seen as compatible with a fetal viral infection [Kovelman and 
Scheibel, Biol. Psychiatry, 19:1601-1621 (1984); Bogerts et al, Arch. Gen. 
Psychiatry, 42:784-791 (1985); Akbarian et al, Arch. Gen. Psychiatry, 50:169-177 
5 (1993); Akbarian et al, Arch. Gen. Psychiatry, 50:178-187 (1993)]. Thus folate or 
cobalamin, deficiency during pregnancy could result in greater susceptibility to viral 
infection affecting mother, fetus, or both. The infectious agent could be influenza 
itself. Alternatively, a severe influenza epidemic could be a "marker" of a severe 
winter, and infection by another agent could cause the brain damage. In this way, 
10 folate or cobalamin deficiency could cause the season-of-birth effect either through 
the mechanism of dietary deficiency alone, through maternal immune deficiency and 
infection, or both. 



Methotrexate, a DHFR inhibitor, is also an important therapeutic agent for 
rheumatoid arthritis. Rheumatoid arthritis has repeatedly been found to have a 
f!J 15 decreased frequency in schizophrenics, a puzzling finding that remains unexplained 

[Eaton et al, Schizophr. Res. f 6:181-192 (1992)]. 

The developmental model of schizophrenia postulates that brain damage sustained in 
the second trimester of fetal life results in schizophrenia later in development [Brixey 
et al, J. Clin. Psychol, 49:447-456 (1993)]. Both folate and cobalamin are already 

20 known to contribute to a first trimester fetal nervous system malformation, spina 
bifida cystica [Kirke et al, Q. J. Med., 86:703-708 (1993); Gordon, Brain Dev., 
17:307-311 (1995)], and possibly other birth defects [Shaw et al, Lancet, 346:393- 
396 (1995); Czeizel, Lancet., 345:932 (1995)]. Some studies [Whitehead etal, Q. J. 
Med., 88:763-766 (1995); van der Put et al, Lancet, 346:1070-1071 (1995); Ou et 

25 al, Am. J. Med. Genet, 63:610-614 (1996); Chatkupt et al, Am. Acad. Neurol Works 
in Progres, WIP4: (1996)] suggest that a genetic susceptibility factor for spina bifida 
is a common allele of the folate gene, MTHFR, the nucleotide 677C->T transition 
converting an alanine residue to valine resulting in a heat-labile enzyme protein. 
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Homozygotes for this allele, about 10% of the normal population, have lower 
erythrocyte folate and plasma folate during pregnancy [Molloy et al, Lancet, 
349:1591-1593 (1997)]. Homozygotes for this allele also develop moderately 
elevated blood homocysteine [van der Put et al, Lancet, 346:1070-1071 (1995); 
5 Frosst et al, Nature Genet, 10:1 1 1-1 13 (1995)] in the presence of dietary folate 
deficiency. Moderate hyperhomocysteinemia is toxic to adults [Fermo et al, Ann. 
Intern. Med., 123:747-753 (1995)], and toxic to the fetus in early gestation [Wouters 
et al, Fertil Steril, 60:820-825 (1993)], and possibly teratogenic in the first trimester 
causing neural tube defects [Whitehead et al, Q. J. Med., 88:763-766 (1995); van der 

10 Put et al, Lancet, 346:1070-1071 (1995); Ou et al, Am. J. Med. Genet, 63:610-614 
(1996). Thus, the MTHFR heat-labile mutation, in the presence of decreased dietary 
folate in midtrimester, could be teratogenic both through hyperhomocysteinemia and 
also through folate deficiency causing the developmental brain damage hypothesized 
in the developmental model of schizophrenia [Brixey et al, J. Clin. Psychol, 49:447- 

15 456 (1993)]. A second common polymorphism of MTHFR, the ntl298 A->C 
mutation could also be a genetic risk factor for spina bifida [van der Put et al, 
Lancet, 346:1070-1071 (1995]. 

Schizophrenia is a common disorder, affecting 1% or more of the population [Karno 
et al, In: Comprehensive Textbook of Psychiatry /VI ^ 6th ed., Baltimore: Williams & 

20 Wilkins, pp. 902-910 (1995)]. Thus, if a significant proportion of schizophrenia 
shares a common etiology, both the genetic susceptibility factors and the 
environmental factors must be common in the population. As mentioned earlier, a 
significant fraction of the population is already sub-clinically deficient for folate and 
for cobalamin; also, pregnancy may increase this fraction since dietary folate and 

25 cobalamin requirements increase during that time. Several functional polymorphic 
alleles of folate and cobalamin genes are also common in the population including the 
MTHFR mutations just mentioned and polymorphisms of thymidylate synthase 
[Horie et al, Cell Struct Funct, 20:191-197 (1995)], transcobalamin II [Li et al, 
Biochim. Biophys. Acta., 1219:515-520 (1994)], and folate-binding proteins [Li et al, 
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1994, supra; Shen et al, Biochem., 33:1209-1215 (1994)]. Metabolic indicators of 
folate or cobalamin deficiency, e.g. hyperhomocysteinemia and 
hypermethylmalonicacidemia, are also common in the population [Naurath et al, 
Lancet., 346:85-89 (1995)]. Thus there exists a statistical basis for the hypothesis 
5 that schizophrenia is a birth defect resulting from the action during gestation of 

genetic risk factors and environmental factors related to folate and/or cobalamin that 
lead to the generation of risk factors. Such factors are sufficiently common that at 
least in principle all cases of schizophrenia could result from this mechanism. 

yi Finally, folate, cobalamin, and pyridoxine are relevant for schizophrenia because of 

10 findings in patients. Severe genetic deficiency of MTHFR may cause a 

Hi "schizophrenia" phenotype [Freeman et al, N. Engl J. Med., 292:491-496 (1975); 

£p 

m Regland et al, L Neural Transm. Gen. Sect, 98:143-152 (1994)]. Genetic deficiency 

!L of other folate and cobalamin enzymes has been reported to cause nervous system 

Ul disease, psychiatric disease, or schizophrenia-like illness [Mudd et al, In: The 

Hi 

1 - I 15 Metabolic and Molecular Bases of Inherited Disease, Scriver et al (eds), New York: 

2 McGraw-Hill pp. 1279-1327 (1995); Hitzig et al, Ciba. Found. Symp., 68:77-91 
(1978); Cooper and Rosenblatt, Annu. Rev. Nutr., 7:291-320 (1987); Shevall and 
Rosenblatt, Can. J. Neurol. Set, 19:472-486 (1992); Hall, Br. J. Haematol, 80:117- 
120 (1992)]. Likewise, dietary deficiencies of folate or cobalamin may have similar 

20 effects [Cooper and Rosenblatt, Annu. Rev. Nutr., 7:291-320 (1987); Shevall and 

Rosenblatt, Can. J. Neurol Scl, 19:472-486 (1992)]. Methylfolate therapy reportedly 
improved the clinical status of schizophrenics with borderline or definite folate 
deficiency [Godfrey et al, Lancet, 2:392-395 (1990); Procter, Br. J. Psychiatry, 
159:271-272 (1991)] although the improvement claimed was small and the finding 

25 controversial. Folate deficiency has been associated with disturbances in mood 
[Shulman, In: Folic Acid in Neurology, Psychiatry, and Internal Medicine, New 
York: Raven Pr., 463-474 (1979)], and it has been suggested that the most common 
neuropsychiatric system abnormality in severe folate deficiency is depression 
[Reynolds et al, Lancet, ii: 196-198 (1984)]. Methyltetrahydrofolate reportedly 
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improved symptoms of depression in an open trial in elderly depressed patients 
[Guaraldi et al. Ann. Clin. Psychiatry 5:101-105 (1993)]. Schizophrenics are reported 
to have an 80% excess mortality from cardiovascular disease [Gottesman, 
Schizophrenia Genesis, Schizophrenia Genesis- The Origins of Madness, W.H. 
5 Freeman & Co. N.Y.(1991)]; hyperhomocysteinemia, dietary folate deficiency and 
the MTHFR 677C->T mutation have been implicated in cardiovascular disease in 
some studies [Morita et aL, Circulation, 95:2032-2036 (1997)] but not others 
(Anderson et al, J. Am. Coll Cardiol. 30:1206-1211 (1997)]. Also, kynureninase, an 
jj; important enzyme of tryptophan metabolism, affecting niacin metabolism and 

^ j 10 serotonin synthesis, is pyridoxine-dependent. Niacin deficiency (pellagra) can cause 

Si mental changes including psychosis and hallucinations [Wilson, Vitamin deficiency 

m and excess, pp. 472-480. In: Harrison 's Principles of Internal Medicine, (Scriber et al. 

P 1 e's.) McGraw-Hill, Inc., N.Y. (1994)]. Also, clozapine, resperidone, and olanzapine 

□ are thought to exert their antipsychotic effect in schizophrenia in part through 

Hi 

S ; 15 serotonin receptor antagonism. 

yj 

rj Gene Localization Studies in Schizophrenia and Folate/ Cobalamine/Pyridoxine 

Genes: If folate, cobalamin, or pyridoxine genes are susceptibility factors for 
schizophrenia, it is possible that gene localization studies have already identified 
candidate chromosome regions that contain such a gene (Tables 3,5, and 7). For three 
20 folate or cobalamin genes, DHFR, TCNII and TYMS, there is excellent concordance 
with schizophrenia gene localization studies. 



On chromosome 5, DHFR has been located at 5ql 1 .2-13.2. A schizophrenia 
translocation [t(l;5)(lq32.3;5ql 1.2-13.3)] was reported [McGillivray et al., Am. J. 
Med. Genet., 35:10-13 (1990); Bassett, Br. J. Psychiatry, 161:323-334 (1992)] 
25 affecting 5ql 1 .2-5ql3.3. A proband and uncle, both with schizophrenia and 

eye-tracking abnormalities, had partial trisomy for 5ql 1.2-5ql3.3; the third copy was 
inserted at lq32.3 giving a derivative chromosome, der(l)inv 

ins(l;5)(q32.2;ql3.3ql 1.2). The proband's mother had a balanced translocation but 
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was phenotypically normal without schizophrenia or eye-tracking abnormalities. She 
had the derivative chromosome 1 with extra material from chromosome 5 inserted but 
a corresponding deletion in one of her chromosomes 5. She thus had only two copies 
of 5ql 1.2-5ql3.3. Further studies [Gilliam et ah, Genomics, 5:940-944 (1989)] 
5 showed that the DHFR gene is located within this deleted region, 5ql 1 .2-1 3.3. 

Another schizophrenia chromosome abnormality, inv5(pl3;ql3), has been reported 
[Bassett, Br. J. Psychiatry, 161:323-334 (1992)] affecting 5ql3. 

On chromosome 5, two-point lod scores of 4.64 and 2.29 were found [Sherrington et 
ah, Nature, 336: 164-167 (1988)] for the polymorphic markers D5S76 and D5S39 
respectively in the region of the chromosome abnormality just discussed [McGillivray 
et al t Am. J. Med. Genet., 35:10-13 (1990); Bassett, Br. J. Psychiatry, 161:323-334 
(1992)] affecting 5ql 1.2-13.3. Two other linkage studies found small positive lod 
scores in this region [Coon et ah, Biol. Psychiatry, 34:277-289 (1993); Kendler and 
Diehl, Schizophr. Bull., 19:261-285 (1993)], but numerous other studies excluded this 
region under the assumptions and models used [Kendler and Diehl, Schizophr. Bull., 
19:261-285 (1993)]. 

On chromosome 18, TYMS has been located at 18pll.32-pll.22. A ring 
chromosome with deletion of 18pter-pl 1 ,18q23-qter [Bassett, Br. J. Psychiatry, 
161:323-334 (1992)] was reported in a kindred with schizophrenia and bipolar illness 
20 [Bassett, Br. J. Psychiatry, 161 :323-334 (1992)]. Deletion of a segment of 18p was 
reported in a schizophrenia chromosome [Bassett, Br. J. Psychiatry, 161:323-334 
(1992)]. 

On chromosome 22, TCNII has been located at 22qll.2-ql3, possibly at the 22ql2/13 
border. High lod scores have consistently been obtained in the region of TCNII: 
25 IL2RB, in 22ql2-ql3.1 gave a lod score [Pulver et ah, Am. J. Med. Genet., 54:3-43 
(1994)] of 2.82. Other markers over a broad region of 22q have given suggestive lod 
scores. D22S278, in 22ql2, gave a lod score [Vallada et ah, Am. J. Med. Genet., 
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60:139-146 (1995)] of 1.51. CRYB2, in 22qll.2-ql2.1, gave alod score [Lasseter et 
al, Am. J. Med. Genet., 60:172-173 (1995)] of 1.71. D22S10, in 22ql 1.1-ql 1.2, gave 
a lod score [Coon et al, Biol. Psychiatry, 34:277-289 (1993)] of 0.79. Highly 
significant p-values for non-parametric analyses have also been obtained: D22S278, 
5 in 22ql2, for example gave p= 001 [Gill et al, Am. J. Med. Genet., 67:40-45 (1996)]. 

The deletions of velocardiofacial (VCF) syndrome and related disorders (DiGeorge 
syndrome (DGS) and CATCH22) are located [Lindsay et al, Genomics, 32:104-112 
(1996)] at 22ql 1.2. A psychotic disorder develops in about 10% of patients with 
VCF syndrome [Chow et al, Am. J. Med. Genet, 54:107-112 (1994)]. TCNII is not 

10 known to be located at or within these deletions. VCF and related disorders are 

relatively uncommon compared to schizophrenia; only 2 of 100 randomly selected 
patients (92 schizophrenics, 5 with schizoaffective disorder, and 3 with 
schizophreniform disorder) in the Maryland Epidemiological Sample were found 
[Lindsay et al, Am. J. Hum. Genet., 56:1502-1503 (1995)] to have VCF-related 

1 5 deletions (and later VCF syndrome) on 22ql 1 .2. Consequently, it is not clear 
whether schizophrenia linkage studies are detecting a haplotype related to a VCS 
locus or some other locus in this region, such as TCNII. 

For some other folate, cobalamin, or pyridoxine relevant genes, physical or genetic 
studies of schizophrenia have identified chromosomal regions near the gene. 

20 DISCUSSION 

The folate-cobalamin hypothesis for schizophrenia is attractive because it suggests 
that a single mechanism of genetic and environmental factors may play a major role 
in the etiology and pathogenesis of schizophrenia. The combined result of this 
mechanism is to damage fetal development, especially brain development by 

25 inhibiting nucleic acid synthesis, by affecting gene methylations, by increasing 
susceptibility to infection, and/or by producing teratogens. 
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This mechanism addresses several puzzling features of schizophrenia such as the 
season of birth effect, the association with famine and influenza epidemics, the 
negative association with rheumatoid arthritis, the associations with obstetrical 
abnormalities, social class, and urban environment. The mechanism also suggests 
5 approaches to diagnostic testing, to prevention, and to improved therapy. 



It is not excluded that such a mechanism could also apply to a number of common 
human developmental disorders that have been shown to have a genetic component to 
n their etiology but whose mode of inheritance has been difficult to determine and for 

which linkage studies have met with unexpected difficulties or have achieved limited 
% i 10 success. These developmental disorders include Tourette's syndrome & related 

ry disorders (e.g. obsessive-compulsive disorder and chronic multiple tics syndrome) 

T [Pauls, Adv Neurol 58:151-157 (1992); McMahon et al, Adv Neurol 58:159-165 

y s 

I (1992); Heutink et al, Am J Hum Genet, 57:465-473 (1995); Grice et al, Am J Hum 

yS Genet, 59:644-652 (1996)], learning disorders, including dyslexia [Lewis, et al, 

j 1 ^ 15 Behav Genet, 23:291-297 (1993); Pennington, J Child Neurol 10 Suppl, 1:S69-S77 

(1995)], conduct disorder [Lombroso et al, J. Am. Acad, Child Adolesc. Psychiatry, 
33:921-938 (1994)], attention-deficit hyperactivity disorder [Lombroso et al, 1994, J. 
Am. Acad. Child Adolesc. Psychiatry, 33:921-938 (1994)], bipolar illness [Baron, 
Acta. Psychiatr. Scand., 92:81-86 (1995); Benjamin and Gershon, Biol Psychiatry, 
20 40:313-316 (1996); Risch and Botstein, Nature Genet., 12:351-353 (1996); Jamison 
and Mclnnis, Nature Med., 2:521-522 (1996); Morell, Science, 272:31-32 (1996)], 
autism [Lombroso et al, 1994,/. Am. Acad. Child Adolesc. Psychiatry, 33:921-938 
(1994)], and obsessive-compulsive disorder in adults [Lombroso et al, 1994, J. Am. 
Acad. Child Adolesc. Psychiatry, 33:921-938 (1994)]. Some of these disorders have 
25 been shown to be associated with schizophrenia. 



The present invention may be better understood by reference to the following non- 
limiting Examples, which are provided as exemplary of the invention. The following 
Examples are presented in order to more fully illustrate one embodiment of the 
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invention. They should in no way be construed, however, as limiting the broad scope 
of the invention. 

EXAMPLE 1 

DIAGNOSING SCHIZOPHRENIA 

5 Structure of Datafiles 

Data are arranged in a file suitable for input into a binary logistic regression program 
01 (Table 8). A model is created consisting of those explanatory variables actually 

u s 

%J available from the specific patient-to-be-diagnosed and family members participating 

HJ in the testing. This new combined data set (reference data set + data from 

pi 10 patient-to-be-diagnosed with participating family members) is analyzed by binary 

L logistic regression for the model chosen giving the predicted probability that a 
proband is affected with schizophrenia for all of the probands including the 

i y 

yj patient-to-be-diagnosed. 

The model can be modified if required. The goodness of fit for the 
15 patient- to-be-diagnosed is checked. The predicted probability that the 

patient-to-be-diagnosed has schizophrenia is compared with a classification table 
generated from the model used to determine likelihood of false positives and false 
negatives. The predicted probability that the patient-to-be-diagnosed is affected with 
schizophrenia, with likelihood of false positive or false negative result, is returned to 
20 the clinician. 
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TABLE 8 

A HYPOTHETICAL PARTIAL REFERENCE DATA SET OF GENETIC 
EXPLANATORY VARIABLES TO ILLUSTRATE DATA STRUCTURE 





ID 


resp 


n 1 1 1 

Pill 


rl 12 


F21 1 


P212 


Ml 1 1 


Ml 12 


Mil 1 


T?C 1 1 

r51 1 


C"^ A 1 1 

b2-41 1 






1 




1 

1 


0 


1 


1 


1 


1 


0 


0 


1 


1 

1 




2 


! 


1 


0 


0 


0 


0 


0 


0 


1 


0 


0 




3 




1 


1 


1 


0 


1 


0 


0 


1 


1 


1 




4 




0 


0 


0 


0 


0 


0 


1 


0 


0 


0 




5 




0 


0 


1 


1 


1 


1 


0 


0 


0 


1 


y e 


6 


0 


1 


0 


0 


0 


0 


0 


0 


0 


0 


0 




7 


0 


0 


0 


0 


0 


0 


0 


1 


0 


0 


0 


njj 
CP 


8 


0 


0 


0 


1 


0 


0 


0 


0 


1 


1 


0 




9 


0 


1 


0 


0 


0 


1 


0 


0 


0 


1 


1 




10 


0 


0 


0 


0 


0 


1 


0 


0 


0 


1 


0 



For each proband (Table 8), the record contains several variables: 
identification number (ID) of the proband. 

a binary response variable (resp) for affection status of the proband: response=l, if the 
proband is affected with schizophrenia; response=0 if proband is unaffected (i.e. a control individual). 
The proband is not necessarily one of the individuals for whom genotype data (explanatory variables) 
are available. The patient-to-be-diagnosed is assigned response=0 when added to the reference data 
set. 

a set of explanatory variables: i.e. sets of genotypes of mutations found in the schizophrenia 
patients and family members and controls and family members. The schizophrenia patients and the 
control individuals are probands (P) as is the patient-to-be-diagnosed. Unaffected family members are 
the proband's mother (M), father (F), sib(s) (SI, S2, etc.), child(ren) (CI, C2, etc.) or other relatives. 
Data for affected family members, e.g. the proband's mother (MA), father (FA), sibs (SA1, SA2, etc.), 
children (CA1, CA2, etc.), or other relatives, are entered as separate explanatory variables. 
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Genetic explanatory variables: Each individual has 0, 1, or 2 copies of any given 
mutation allele at a given locus. Thus a genotype at each locus contributes two 
independent explanatory variables. Most of the affected family members will be 
relatives of schizophrenia probands, but occasionally a relative of an unaffected 
5 proband will turn out to be affected with schizophrenia. 

Mutations are tabulated as explanatory variables: {see Table 8): 

(i) by the proband or relative in whom they occur, (e.g. P, M, F, S2, CI, MA, FA, 
SA1, CA1, other); 

(ii) by the specific folate, cobalamin, or pyridoxine gene locus in which they 
occur (e.g. 1=DHFR locus, 2=MTHFR locus, 3=TCN2 locus, 4-MTR locus, 5=CBS 
locus, etc.); 

(iii) by the specific mutation within a locus (e.g., l=the first-designated mutation 
within a locus, 2=the second-designated mutation within a locus, etc.); and 

(iv) by whether the individual has a single or double dose of the mutation. Thus 
an explanatory variable P321 records whether the proband has a single dose of the 
second-designated mutation of the third-designated locus, i.e. TCN2. A variable 
M312 records whether the proband's mother has a double dose of the first-designated 
TCN2 mutation studied. 

In the present hypothetical reference dataset illustrated of genetic explanatory 
20 variables (Table 8), partial genotype data for probands, mothers, fathers, sibs and 

children are given for five gene loci. Not all of the possible explanatory variables are 
shown. Probands 1-5 are unrelated individuals with the definite clinical diagnosis of 
schizophrenia; probands 6-10 are unrelated unaffected (control) individuals. 
Probands 1, 2, 3, 6 and 9 all have a single copy of the first-designated DHFR 
25 mutation; proband 3 also has a second copy of that mutation. Probands 1, 3, 5 and 8 
all have a single copy of the first-designated mutation at the MTHFR locus; probands 
1 and 5 also have a second copy of that mutation. Mothers of probands 1, 3, 5, 9 and 
10 all have a single copy of the first-designated DHFR mutation; mothers of probands 



ill 



10 



15 
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1 and 5 also have a second copy of this mutation. Mothers of probands 4 and 7 each 
have a single copy of the first-designated mutation of TCN2; data for a double dose 
are not shown. The fathers of probands 2, 3, and 8 each have a single copy of the first 
designated mutation of CBS; data for a double dose are not shown. The second 
5 (unaffected) sibs of probands 1, 3, 8, 9, and 10 each have a single copy of the 

first-designated mutation of MTR; data for a double dose are not shown. The first 
affected children of probands 1, 3, 5, and 9 each have a single copy of the 
first-designated mutation of DHFR. Other susceptibility loci and mutations can be 
incorporated in Table 8 in the same fashion e.g., cytokine gene mutations or 

€i 10 polymorphisms, or major histocompatibility complex (MHC) mutations or 

%i polymorphisms. 

pi Environmental explanatory variables: If only genetic explanatory variables 

(genotype data) are used, the maximum predicted probability that the proband is 
^ affected with schizophrenia is expected to be approximately about 0.5 in most 

yi 

rU 15 populations. When environmental risk factors are included as explanatory variables, 

pi the maximum predicted probability that the proband is affected with schizophrenia 

W may approach 1.0. Examples of environmental risk factors for a schizophrenia patient 

include: 

(1) the proband's dietary folate/cobalamin/pyridoxine intake. 
20 (2) the proband's circulating levels of folate/cobalamin/pyridoxine. 

(3) the proband's circulating levels of homocysteine, methylmalonic acid, or 
cystathionine. Elevated levels are indicators of subtle folate/cobalamin deficiency. 

(4) the proband's mother's dietary folate/cobalamin/pyridoxine intake at the time 
of patient diagnosis, during a pregnancy, or during the pregnancy that produced the 

25 proband. 

(5) the proband's mother's circulating levels of homocysteine, methylmalonic 
acid, or cystathionine at the time of patient diagnosis, during a pregnancy, or during 
the pregnancy that produced the proband. 
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(6) dietary or circulating folate/cobalamin/pyridoxine or circulating levels of 
homocysteine, methylmalonic acid, or cystathionine for other family members. 

(7) epidemiological factors related to the proband's gestation and birth, e.g. low 
birth weight or preterm birth, maternal infection, maternal smoking (associated with 

5 low plasma folate), season of birth (late winter or spring births are more common in 
schizophrenia), etc. 



-a- 



Method of Data Analysis 
The method exemplified herein is based upon the published guide for the SAS 
system, but other software can be used. The dataset is analyzed using binary logistic 
10 regression to model the response probability, p i5 that the ith proband's affection status 
is 1, i.e. the probability that the ith proband has schizophrenia, given the vector of 
sj explanatory variables, x { . That is: 

^ p i = Prob(y i =l|x i ). 

EP To do this the logit transformation of p 1 is modeled as a linear function of the 

p 15 explanatory variables in the vector, x^ 

W j logit (pO = log (p/[ 1 -Pi]) = alpha + beta'X; 

yj where: alpha is the intercept parameter and 

m beta is the vector of slope parameters. 

In SAS, the "descending' 1 option is used to model the probability that the response=l, 
20 as in the present analysis, rather than response=0. 



Outputs of binary logistic regression analysis 
After analysis of a dataset, the outputs obtained from SAS include: 

(a) Estimates and standard errors of the parameters (alpha and beta). 
Using estimates of the intercept parameter (alpha) and the slope parameter (beta) for 

25 each environmental or genetic risk factor, the logistic regression equation for the 
dataset can be written. 

(b) Significance tests of the parameters (e.g. Wald chi-square). From the 
corresponding p-values, the level of significance of each of the environmental or 
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genetic risk factors is determined. A global significance test of the data with 
corresponding p-value is also determined. 

(c) Odds ratios are given for the slope parameters of each environmental 
or genetic risk factor. Thus the amount contributed by each environmental or genetic 

5 risk factor to the risk of schizophrenia is determined. 

(d) The confidence limits for regression parameters and odds ratios are 
determined. 

(e) The predicted probabilities of the observations can be computed, i.e. 
rj the probability that each individual in the dataset has schizophrenia: 

ili 10 alpha- = estimate of the intercept parameter; 

beta- = vector of the estimates of the slope parameters; 
flj x = vector of the explanatory variables; 

p~ = predicted probabilities 
1 



y 



1 5 r)— = 

p 1 + exp(alpha~ - beta-'x) 



n j (f) The model is modified by adding or removing variables until a model 



is found that best fits the data; 

(g) The model is tested for goodness-of-fit. Also, the degree of influence 
20 of each specific observation is tested to detect extreme or ill-fitting observations. 

These may be examples of data entry errors or alternatively, observations that do not 
fit the present model for schizophrenia. 

(h) The probability that a new individual (the patient-to-be-diagnosed) is 
schizophrenic is then calculated from the final, modified, best fitting regression 

25 equation based upon parameters derived from a corrected/modified data set. A simple 
method of doing this is to add the data for the patient-to-be-diagnosed to the reference 
data set, a large group of well-studied schizophrenia probands, schizophrenia family 
members, control probands and control family members for whom data are available 
for many explanatory variables. A model is created consisting of those informative 

30 explanatory variables actually available from the specific patient-to-be-diagnosed and 
family members participating in the testing. This new combined data set (reference 
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data set + data from patient-to-be-diagnosed with participating family members) is 
analyzed by binary logistic regression for the model chosen giving the predicted 
probability that a proband is affected with schizophrenia for all of the probands 
including the patient-to-be-diagnosed. 
5 (i) A classification table is produced from the data set by the "jack 

knifing" procedure or an approximation to it. This procedure classifies each 
observation as an event or nonevent based on the model that omits the observation 
being classified. A classification table sorts observations into percent correct, percent 
false positives, and percent false negatives at various probability levels and computes 
1 0 sensitivity and specificity. 

(j) The data set used for diagnostic testing is constantly being updated 
and the regression equation corrected. For example, stratification by geographic 
residence or geographic origin of ancestors must be considered for some 
environmental or genetic risk factor. 

15 For example, in Table 9, entries 34-43 are shown for the data file containing 

genotypes of 38 schizophrenic probands plus 211 control probands; the first 38 are 
the affected probands. For individual 302088, the proband is affected ("1"); there is a 
single dose ("1") of the DHFR mutation but not a double dose ("0") and a single dose 
("1") of the MTHFR mutation but not a double dose ("0"). The number 302088 

20 identifies the individual whose genotypes are listed; the proband, in this case, is the 
same individual. 
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TABLE 9 

SAS DATAFILE FOR SCHIZOPHRENIA PATIENTS AND CONTROLS 



5 


34 


302086 


1 


1 


0 


1 


1 




35 


302088 


1 


1 


0 


1 


0 




36 


302110 


1 


1 


0 


1 


0 




37 


302111 


1 


1 


0 


0 


0 




38 


302136 


1 


1 


1 


1 


0 


Ul 10 


39 


100001 


0 


1 


0 


0 


0 




40 


100061 


0 


0 


0 


0 


0 


ni 


41 


100064 


0 


1 


0 


1 


0 


fn 


42 


100067 


0 


0 


0 


1 


0 


s 


43 


100073 


0 


1 


0 


0 


0 



Ul 15 

ni 



In Table 10, entries 31-40 are shown for the data file containing genotypes of 35 
20 mothers of schizophrenic probands plus (the same) 211 control probands. For 

individual 302083, the proband is affected ("1"); there is a single dose of the DHFR 
mutation ("1) but not a double dose ("0"); there is neither a single ("0") nor a double 
("0") dose of the MTHFR mutation. The number 302083 identifies the individual 
whose genotypes are listed, a mother; the proband, in this case, is a different 
25 individual, her affected child. 



# 
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TABLE 10 

SAS DATAFILE FOR SCHIZOPHRENIA MOTHERS AND CONTROLS 



5 


31 


302083 


1 


1 


0 


0 


0 




32 


302103 


1 


0 


0 


1 


0 




33 


302104 


1 


0 


0 


1 


0 




34 


302105 


1 


1 


0 


1 


0 




35 


302120 


1 


0 


0 


0 


0 


Si 10 


36 


100001 


0 


1 


0 


0 


0 




37 


100061 


0 


0 


0 


0 


0 




38 


100064 


0 


1 


0 


1 


0 


E 


39 


100067 


0 


0 


0 


1 


0 




40 


100073 


0 


1 


0 


0 


0 



In Table 11, entries 1 1-20 are shown for the data file containing genotypes of 15 
fathers of schizophrenic probands plus (the same) 211 control probands. For 
20 individual 302084, the proband is affected ("1"); there is a single dose ("1") but not a 
double dose ("0") of the DHFR mutation; there is both a single ("1") and a double 
dose ("1") of the MTHFR mutation. The number 302084 identifies the individual 
whose genotypes are listed, a father; the proband, in this case, is a different 
individual, his affected child. 
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TABLE 1 1 



SAS DATAFILE FOR SCHIZOPHRENIA FATHERS AND CONTROLS 



11 


302102 


1 


0 


0 


0 


0 


12 


302106 


1 


1 


0 


0 


0 


13 


302115 


1 


1 


0 


0 


0 


14 


302117 


1 


1 


0 


0 


0 


15 


302084 


1 


1 


0 


1 


1 


16 


100001 


0 


1 


0 


0 


0 


17 


100061 


0 


0 


0 


0 


0 


18 


100064 


0 


1 


0 


1 


0 


19 


100067 


0 


0 


0 


1 


0 


20 


100073 


0 


1 


0 


0 


0 



In Table 12, entries 9-18 are shown for the data file containing genotypes of 13 
unaffected sibs of schizophrenic probands plus (the same) 211 control probands. For 
20 individual 302089, the proband is affected ("1"); there is a single dose ("1") but not a 
double dose ("0") of the DHFR mutation; there is both a single ("1") and a double 
dose ("1") of the MTHFR mutation. The number 302089 identifies the individual 
whose genotypes are listed, an unaffected sib; the proband, in this case, is a different 
individual, the affected sib of individual 302089. 
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TABLE 12 

SAS DATAFILE FOR SCHIZOPHRENIA SIBS AND CONTROLS 

5 



09 


302071 




1 


1 


0 


0 


10 


302073 


1 


0 


0 


1 


0 


11 


302089 


1 


1 


0 


1 


1 


12 


302118 


1 


1 


0 


0 


0 


13 


302126 


1 


1 


0 


0 


0 


14 


100001 


0 


1 


0 


0 


0 


15 


100061 


0 


0 


0 


0 


0 


16 


100064 


0 


1 


0 


1 


0 


17 


100067 


0 


0 


0 


1 


0 


18 


100073 


0 


1 


0 


0 


0 



In Tables 9-12 for individual 100061, the proband is unaffected ( ff 0 M ); there is neither 
a single dose ("0") nor a double dose ("0") of the DHFR mutation; there is neither a 
20 single dose ("0") nor a double dose ("0") of the MTHFR mutation. Since the proband 
is unaffected, this is a control individual. The number 100061 identifies the individual 
whose genotypes are listed, as a control individual; the proband, in this case, is the 
same individual. The identical group of control individuals is used for all four 
comparisons. 



# 



in 
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EXAMPLE 2 

Distribution of Folate Gene Polymorphism Genotypes Among Schizophrenics, 
Schizophrenia Parents, Schizophrenia Sibs, and Controls 

Summary 

5 The DNA polymorphism-Diet-Cofactor-Development hypothesis (DDCD hypothesis, 
described above) postulates that schizophrenia results in part from developmental 
brain damage sustained in utero from the aggregate effect of maternal defects of 
genes related to important cofactors, e.g. folate, cobalamin, pyridoxine, potentiated by 
a maternal dietary deficiency of these cofactors. The maternal damage to the fetus 
1 0 results in part from insufficiency of these cofactors themselves and in part from 
resulting effects such as immune deficiency and maternal teratogens, e.g. 
hyperhomocysteinemia. Genes from either parent acting in the fetus may modify 
these damaging effects as outlined in the gene-teratogen model (described above). 

The hypothesis addresses all of the unusual biological and epidemiological features of 
15 schizophrenia: e.g. the decreased amount of grey matter in brain areas, the unusual 
birth-month effect, the geographical differences in incidence, the socioeconomic 
predilection, the association with obstetrical abnormalities (low birth weight and 
prematurity), the decreased incidence of rheumatoid arthritis, and the association with 
viral epidemics (described above). 

20 The hypothesis can be supported by finding significant association of sequence 
variants of folate, cobalamin, or pyridoxine genes with schizophrenia. Folate, 
cobalamin, and pyridoxine absorption, transport, and metabolism are complex 
[Rosenblatt, In: The Metabolic and Molecular Bases of Inherited Disease, Scriver et 
al. (eds), New York: McGraw-Hill, pp. 31 1 1-3128 (1995); Benton and Rosenberg, In: 

25 The Metabolic and Molecular Bases of Inherited Disease, Scriver et al. (eds), New 
York: McGraw-Hill, pp. 3129-3149 (1995); Whyte et al, Hypophosphatasia, In: The 
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Metabolic and Molecular Bases of Inherited Disease, Scriver et al. (eds), New York: 
McGraw-Hill pp. 4095-41 11] with multiple transport proteins, enzymes, and 
regulatory components. A strong candidate for harboring a mutation predisposing to 
schizophrenia is the DHFR gene coding for the folate enzyme dihydro folate 
5 reductase. DHFR chemically reduces dietary folate converting it into a form that can 
enter cellular metabolism. DHFR is also important for DNA synthesis and is known 
to play a major role in development in utero. A novel polymorphic 19 basepair 
deletion of the DHFR gene has been isolated which could be of functional 
significance because it affects potential transcription factor binding sites. 

A second candidate is the MTHFR gene, coding for methylenetetrahydrofolate 
reductase, MTHFR, an important enzyme of folate metabolism. MTHFR was of 
particular interest because severe deficiency of enzyme activity has been associated 
with the "schizophrenia" phenotype [Freeman et al, N. Engl. J. Med., 292:491-496 
(1975); Regland et al., J. Neural Transm. Gen. Sect, 98:143-152 (1994)] and because 
a common mutation, the nt677 C->T transition results in a mutated gene that encodes 
a heat-labile MTHFR, having decreased enzymatic activity, which in the presence of 
dietary folate deficiency, causes the plasma homocysteine of homozygotes to become 
elevated [van der Put et al, Lancet, 346:1070-1071 (1995); Frosst et al, Nature 
Genet, 10:111-113 (1995)]. In adults, hyperhomocysteinemia is known to cause 
vascular disease and to be toxic [Frosst et al, Nature Genet., 10:111-113 (1995)]. 
Therefore, homocysteine that crosses the placenta could act as a fetal teratogen during 
pregnancy. Maternal folate deficiency could also have a more direct teratogenic 
effect through fetal folate deprivation. These effects could be potentiated by 
abnormalities of other folate, cobalamin, or pyridoxine genes, even if these 
abnormalities were only minor. 

Materials & Methods : 
1. Subjects and Sample Collection: Patients with schizophrenia and unaffected family 
members of schizophrenics, were ascertained from patient facilities, patient support 



10 



15 



20 



25 
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groups, and family support group organizations. Nearly all schizophrenia families 
had only a single case of schizophrenia. The patients came from different 
schizophrenia families than the parents and sibs. The controls were unaffected and 
unrelated individuals not known to be schizophrenic or related to patients with 
5 schizophrenia or spina bifida. All subjects were of Caucasian background except two 
of the schizophrenia patients who were of African American background. 

After informed consent was obtained, 20-40 ml of blood was collected into EDTA 
(purple-top) vacutainers, placed on ice immediately, and transported to the laboratory 
where plasma, packed red cells, and buffy coat were separated by centrifugation and 
frozen at -80°C. 

2. Detection of Alleles: DNA was isolated using the QIAmp column DNA extraction 
procedure or the QIAGEN Genomic-tip method (QIAGEN, Chats worth, CA). Alleles 
for a newly detected polymorphic 19 bp deletion in the dihydro folate reductase 
(DHFR) gene were determined by polymerase chain reaction (PCR) amplification of 
the region surrounding the deletion using specific primers (Fig 1) and direct detection 
of the PCR products after separation of products on a non-denaturing polyacrylamide 
gel. A Cetus - Perkin-Elmer 9600 thermocycler was used. Briefly, the PCR reaction 
contained 200 uM dNTPs, 1.5 mM MgCl 2 , 10 pmols of each primer, in 10 ul reaction 
volume. The PCR conditions used were denaturation at 94°C for 6 min. initially, 
followed by 35 cycles of 94°C for 55 sec, 60°C for 55 sec, and 72°C for 55 sec and a 
final extension at 72°C for 12 min. 

Alleles for the 677C->T transition of the methylenetetrahydro folate reductase 
(MTHFR) gene were determined by cleavage with the restriction endonuclease, 
Hinfl, of PCR-amplified genomic DNA from blood and separation of the products by 
25 non-denaturing polyacrylamide gel electrophoresis [Frosst et ah, Nature Genet., 
10:111-113 (1995)]. 
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3. Sequencing the Region Around the DHFR Deletion : Using the same primers 
(Figure 1), genomic DNA from individuals with 1,1 and 2,2 genotypes was amplified 
by PCR and the products sequenced using an ABI PRISM 377 automated sequencer. 
Restriction sites were identified using the MAP Program in the GCG Package. 
Potential transcription factor binding sites were detected with the TESS program 
(transcription element search software, 
URL:http://agave.humgen.upenn.edu/tess/index.html). 

4. Data Analysis: Since the mode of inheritance of schizophrenia is unknown, binary 
logistic regression was used to test the DHFR deletion allele and the MTHFR 
heat-labile allele as genetic risk factors for schizophrenia. Either the DHFR deletion 
polymorphism or the MTHFR heat-labile allele could itself be a genetic risk factor for 
schizophrenia. The genotypes of the two folate gene polymorphisms were used as 
explanatory variables. Genotypes of schizophrenia patients, parents, or sibs were 
compared with those of controls. 

Four files were constructed consisting of schizophrenia patients+controls, mothers of 
schizophrenia patients+controls, fathers of schizophrenia patients+controls, and sibs 
of schizophrenia patients+controls for input into the SAS System. Each dataset 
contained 6 variables. In order, these were: 

1 . six digit identification (ID) number; 

2. response variable, i.e. affection status of the proband 
(0=unaffected, i.e. control individual; l=affected, i.e. schizophrenia patient); 

3. DHFR mutation-single dose (Ds); 

4. DHFR mutation-double dose (Dd); 

5. MTHFR mutation-single dose (Ms); and 

6. MTHFR mutation-double dose (Md). 

For mutation data, 0=mutation absent, l=mutation present. 
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Results 

Alleles of the DHFR 19 bp Deletion Polymorphism: Amplification of the region of 
intron 1 of DHFR defined by the primers in Figure 1 gave two polymorphic bands of 
232 and 213 bp after separation on a non-denaturing polyacrylamide gel (Figure 2). 
Sequencing the PCR products from the two homozygotes showed that they differed 
by 19 bp (Figure 3). The upper and lower bands (Figure 2), non-deletion allele and 
deletion allele respectively, were designated alleles 1 and 2 respectively. Comparison 
with two published sequences showed that allele 1 was identical with one of them 
[Yang et al J. Mol Biol 176:169-187 (1984)] indicating that allele 2 resulted from a 
19 bp deletion. The other published sequence [Chen et al J. Biol Chem. 259:3933- 
3943 (1984)] was lacking one base pair of allele 1, an A indicated by "*" in Fig 3. It 
is possible that this shorter reference sequence [Chen et al J. Biol Chem. 259:3933- 
3943 (1984)] resulted from a sequencing artifact. 

Sequences in the 19 bp Deleted Region of DHFR Intron 1: The 19bp sequence in the 
deleted region (Fig 3) of DHFR intron 1 contained sites for several restriction 
enzymes including Rsal and ScrFI, and potential binding sites for transcription factors 
including Spl, NF-kappaB, CP1 (NF-Y), E2F, ETF and GCF in the 19 base pair 
region. 

Binary Logistic Regression Analysis: The number of individuals with each genotype 
of the two polymorphisms among 38 unrelated schizophrenia probands, 35 unrelated 
mothers of schizophrenia probands, 15 unrelated fathers of schizophrenia probands, 
13 unrelated unaffected sibs of schizophrenia probands, and 211 unrelated unaffected 
control probands is shown in Table 13. 
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TABLE 13 

DISTRIBUTION OF DHFR AND MTHFR MUTATION GENOTYPES 
AND ALLELES AMONG CONTROLS. SCHIZOPHRENICS. 
AND SCHIZOPHRENIA FAMILY MEMBERS 



5 DHFR 1 9 bp deletion polymorphism : 

— GenTyp— Schizophrenia — Ctrl — 









P 


M 


F 


S 








1/1 


6 (.16) 


10 (.29) 


4 (.27) 


4 (.31) 


56 (.26) 


if* 




1/2 


22 (.58) 


13 (.37) 


11 (.73) 


8 (.61) 


115 (.54) 


Ul 


10 


2/2 


10 (.26) 


12 (.34) 


0 (0.0) 


1 (.08) 


40 (.19) 






total 


38 (1.00) 


35 (1.00) 


15 (1.00) 


13 (1.00) 


211 (.99) 


m 

01 






MTHFR 677C->T transition polymorphism: 
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1/1 


14 (.37) 


16 (.46) 


11 (.73) 


4 (.31) 


103 (.49) 






1/2 


18 (.47) 


18 (.51) 


3 (.20) 


8 (.61) 


78 (.37) 






2/2 


6 (.16) 


1 (.03) 


1 (.07) 


1 (.08) 


30 (.14) 






total 


38(1.00) 


35 (1.00) 


15(1.00) 


13 (1.00) 


211 (1.00) 



20 



P=schizophrenia patients; M=mothers of schizophrenia patients; F=fathers of 
schizophrenia patients; S=unaffected sibs of schizophrenia patients; Ctrl=control 
individuals. 
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The four data files were analyzed using the logistic procedure of SAS (SAS Institute 
Inc., 1995) and the "descending" option, which modeled the probability that 
RESPONSE=l, that is, the probability that the proband was affected with 
schizophrenia. Note that the proband was not always the individual whose genotype 
5 data were used. For example, genotype data for mothers of schizophrenic probands 
were used to determine the probability that their children, the probands, were affected 
Use of the "best" model selection options for logistic analysis in SAS gave the best 
models for two and three explanatory variables, (Table 14). 



92 
Table 14 

BINARY LOGISTIC REGRESSION RESULTS 



GENETIC RISK FACTOR MODEL: Ds Dd Ms Md 
Odds Ratio (p value) 

Schizophrenia Patients 

DsOR(p) 1.937 (.18) 

DdOR(p) 1.263 (.59) 

MsOR(p) 1.775 (.14) 

Md OR(p) 0.914 (.86) 



Mothers of Schizophrenia Patients 
DsOR(p) 0.630 (.31) 

Dd OR(p) 2.653 (.028)* 

MsOR(p) 1.439 (.34) 

MdOR(p) • 0.143 (.065) 



Fathers of Schizophrenia Patients 
DsOR(p) 1.178 (.79) 

Dd OR(p) 0.000 (.96) 

MsOR(p) 0.366 (.14) 

Md OR(p) 0.841 (.88) 



Unaffected Sibs of Schizophrenia Patients 



Ds OR(p) 
Dd OR(p) 
Ms OR(p) 
Md OR(p) 



1.104 (.88) 
0.337 (.31) 
2.688 (.12) 
0.317 (.29) 
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Notes For Table 14 

DHFR 19 bp deletion : Ds=single dose; Dd=double dose 

MTHFR 677C->T mutation : Ms=single dose; Md=double dose 

Logistic regression model: 

Model with four explanatory variables (Ms, Md, Ds and Dd). 

OR(p)=odds ratio and the corresponding p-value for that odds ratio 
determination *=significant at the p<.05 level. 

0.000 odds ratios occurred since none of the fathers of schizophrenia patients 
had genotype Dd; there was a possibly quasi- complete separation in the sample 
points; the maximum likelihood estimate may not exist; and therefore validity of the 
model fit for these odds ratios was questionable. 

The comparison of mothers of schizophrenia probands with control probands was 
statistically significant. Ds was not a significant genetic risk factor. Neither Ms nor 
Md in mothers was a significant genetic risk factor. However, the p-value for Md 
decreased and approached significance (p=.065) at the p<.05 level. 

Predicted Probabilities of the Various Genotypes : The "probs predicted" modality of 
SAS, gave the predicted probability that the proband was affected with schizophrenia 
(response=l) given genotype data for control probands and schizophrenia patients 
(probands), mothers of schizophrenia probands, fathers of schizophrenia probands, or 
sibs of schizophrenia probands. The maximum probabilities obtained are shown in 
Table 15. The highest maximum predicted probability that the proband was affected 
was obtained for genotype data from mothers of schizophrenia probands, next for 
schizophrenia probands, next for fathers of schizophrenia probands, and lowest for 
sibs of schizophrenia probands. 




94 

TABLE 15 

MAXIMUM PREDICTED PROBABILITY 



Model P M E S 

Ds Dd Ms Md 0.24 0.29 0.12 0.11 

Model and explanatory variables are the same as in Table 14. 
************************************************ 

Determination of Genotypes Conferring the Highest Risk: The predicted probabilities 
that the proband was affected with schizophrenia given specific genotypes of control 
probands and schizophrenia probands, mothers of schizophrenia probands, fathers of 
schizophrenia probands, or sibs of schizophrenia probands were determined using the 
5 model containing all four explanatory variables (Table 16). The predicted 

probabilities that the proband was affected with schizophrenia were highest for 
maternal genotypes (Table 15). The maternal genotype with the highest risk was Dd 
Ms, conferring a probability of 0.29 of schizophrenia in the proband (Table 16). The 
Dd Ms genotype also gave the highest predicted probability, 0.24, for schizophrenia 
10 patients. 
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TABLE 16 

PREDICTED PROBABILITIES FOR SPECIFIC GENOTYPES 



Genotype 

Schizophrenia Patients : 

Dnull + Mnull 

Dnull + Ms 

Dnull + Md 

Ds + Mnull 

Dd + Mnull 
Mothers of Schizophrenia Patients : 



Model: Ds Dd Ms 
Predicted 
Probability 

0.07 
0.12 
0.11 
0.12 
0.15 



Md 



Dnull + Mnull 0.16 

Dnull + Ms 0.20 

Dnull + Md 0.03 

Dd + Mnull 0.22 

Ds + Mnull 0.10 

Fathers of Schizophrenia Patients : 

Dnull + Mnull 0.10 

Dnull + Ms 0.04 

Dnull + Md 0.03 

Ds + Mnull 0.12 

Dd + Mnull 0.0 

Unaffected Sibs of Schizophrenia Patients : 

Dnull + Mnull 0.04 

Dnull + Ms 0.10 

Dnull + Md 0.03 

Ds + Mnull 0.04 

Dd + Mnull 0.02 



Genotype 



Ds + Ms 

Ds +Md 

Dd +Ms 

Dd +Md 



Ds +Ms 

Ds +Md 

Dd + Ms 

Dd + Md 



Ds +Ms 

Ds + Md 

Dd +Ms 

Dd + Md 



Ds +Ms 

Ds +Md 

Dd + Ms 

Dd +Md 



Predicted 
Probability 

0.20 
0.19 
0.24 
0.23 



0.13 
0.02 
0.29 
0.06 



0.05 
0.04 
0.0 
0.0 



0.11 
0.04 
0.04 
0.01 



Genotypes consist of the same explanatory variables described in Table 14 except that Dnull has no 
copy of the DHFR deletion and Mnull has no copy of the MTHFR 677C->T variant. Odds ratios of 
0.0 were unsatisfactory as described in Table 14. 
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Discussion 

Structure and Function of the DHFR 19 bp Deletion Polymorphism: DHFR 
polymorphisms have been reported previously [Feder et al. 9 NucL Acids Res. 15:5906 
(1987); Detera-Wadleigh et aL, NucL Acids Res. 17:6432 (1989)]. It is known that 
5 introns are important for message regulation e.g. , splicing, or as sites for binding 
transcription factors. Since the first intron is a relatively common location for 
regulatory elements, it is possible that the deleted region of DHFR intron 1 could play 
a role in regulation of DHFR or that the deletion could be a genetic risk factor for 
schizophrenia because it removes potential transcription factor binding sites. 
10 Abnormalities of transcription factors and their binding sites may play a role in 

disease. For example, a polymorphic Spl binding site in the collagen type I alpha 1 
gene has been associated with reduced bone density and osteoporosis [Grant et aL 9 
Nature Genet. 14:203-205 (1996)]. 

The Nature of the Putative Folate Genetic Risk Factors for Schizophrenia: Dd in the 
1 5 mother of a schizophrenia proband conferred significantly increased risk of 

schizophrenia in her child (Table 14). The findings that Dd was a genetic risk factor 
in mothers but not fathers of schizophrenia probands (Table 15) and that Dd in 
mothers gave a higher predicted probability than in schizophrenia patients, fathers or 
sibs (Tables 15 and 16) was consistent with the role of DHFR as a teratogenic locus 
20 according to the gene-teratogen model (described above). The finding that a double 
dose but not a single dose of the DHFR deletion in mothers was a genetic risk factor 
(Table 16) supported a recessive mode of action in the mother. A teratogenic locus 
acting in the mother can also act as a modifying or specificity locus in the fetus. 

Neither Ms nor Md in mothers of schizophrenia probands showed statistical 
25 significance as genetic risk factors for schizophrenia in probands (Table 14). However 
Md in mothers approached statistical significance (p= 065) and appeared to be 
protective (odds ratio 0.14), while Ms in mothers appeared to increase risk modestly 
(odds ratio 1.44, p=.34). 



Ik 
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Role of Genetic and Environmental Factors in Schizophrenia : Since the probability 
that a schizophrenia co-twin is also affected is reported [Gottesman, Schizophrenia 
Genesis, Schizophrenia Genesis- The Origins of Madness, W.H. Freeman & Co. 
N.Y.(1991)] to be only 48%, a large part of the risk for schizophrenia would be 
5 anticipated to come from environmental factors. Therefore, some controls should 
have the genetic risk factors for schizophrenia but not be affected with schizophrenia. 
In the present data set, 6 of 35 schizophrenia mothers and 7 of 38 schizophrenia 
patients had Dd Ms, the genotype conferring the highest risk, compared with 15 of 
Q 211 controls. Since this genotype gave predicted probabilities of schizophrenia in 

y5 10 probands of 0.29 and 0.24 respectively, polymorphisms of DHFR and MTHFR could 

y\ account for a considerable portion of the genetic component of the risk of 

fU schizophrenia. 

m 

^ Relation of DHFR to Cytogenetic and Linkage Data for Schizophrenia: As discussed 

yi above, the DHFR gene has been located on chromosome 5 at 5ql 1 .2-13.2. A 

Pi 15 schizophrenia translocation was reported (McGillivray et al.1990; Bassett, 1992) 

M affecting 5ql 1.2-5ql3.3. Also two-point lod scores of 4.64 and 2.29 were found 

[Sherrington et aL, Nature, 336: 164-167 (1988)] for the polymorphic markers D5S76 
and D5S39 respectively on chromosome 5, in this region [McGillivray et aL, Am. J. 
Med. Genet, 35:10-13 (1990); Bassett, Br. J. Psychiatry, 161:323-334 (1992)]. Two 
20 other linkage studies found small positive lod scores in this region [Coon et aL, BioL 
Psychiatry, 34:277-289 (1993); Kendler and Diehl, Schizophr. Bull., 19:261-285 
(1993)], but numerous other studies excluded this region under the assumptions and 
models used [Kendler and Diehl, Schizophr. BulL, 19:261-285 (1993)]. Recently, 
new studies have found suggestive evidence for a potential susceptibility locus at a 
25 different region of 5q, 5q31 [Schwab et aL, Nat. Genet. 11:325-327 (1997)] and 
5q22-31 [Straub et aL, Molec Psychiatr. 2:148-155 (1997)]. 



The case-control study presented herein illustrates the usefulness of the DNA 
polymorphism-Diet-Cofactor-Development and the gene-teratogen models described 
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above. More importantly, the results presented herein, clearly fail to reject the specific 
models, i.e., that folate gene polymorphisms can play a role in the etiology of 
schizophrenia. 

The present invention is not to be limited in scope by specific embodiments described 
herein. Indeed, various modifications of the invention in addition to those described 
herein will become apparent to those skilled in the art from the foregoing description 
and the accompanying figures. Such modifications are intended to fall within the 
scope of the appended claims. 

Various publications in addition to the immediately foregoing are cited herein, the 
disclosures of which are incorporated by reference in their entireties. 



